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1 CTCTTACTCT 
51 CAATAAGGAG 
101 ATGATGTGGC 
151 TGCTGGTGGA 
201 TGAATACTAG 
251 GAAGTCACCA 
301 TGGGCGAACA 
351 AGCATGCCCT 
401 GGAAGCCTTG 
451 AAACAGTCGG 
501 CAGATGAGAA 
551 CTCCCAGGAG 
601 GTATTTCATT 
651 CCACCATGCC 
701 CCTACGATCT 
751 ACTTCCAAAT 
801 TAGAAGATAA 
851 ATACTCTGGT 
901 CAAGAAAAAT 
951 CCACTGGTTC 
1001 TCTGATGAAT 
1051 ACATTACAAT 
1101 TGCCTACTGC 
1151 GATGTGGCCA 
1201 CCTATTGCCA 
1251 CCCCTCAACG 
1301 TCCTAAATGC 
1351 AAGCCCTTAC 
(SEQ ID NO: 1) 



TCAGCCTGAT 
TCCTTGTGAG 
TGCTTTTAAC 
TTCCTTGATT 
TGAAATCATC 
CTGAAGATGG 
CATGCTAGGA 
GTTTGCAGAC 
GATTCCTTCT 
GGAAACACTT 
ATTCTGGGCC 
TAATAGACTT 
GGACATTCAC 
TGAACTGGCA 
CATTCAAATA 
TCCATAATCA 
GAAAACGAAG 
TGATATGTAG 
ATGAATCAGA 
ATCAGTACAC 
TCAGAGCTTA 
CAGAGTCATC 
TATTTGGGCT 
GGATACTCCC 
GAATGGGAAC 
GATGTTCAGT 
CAATGCATTT 



GTCAAAAGCA 
CAGGTGAAGC 
AACAACTTGT 
TGGAAAATGA 
ATCTACAATG 
GTATATACTC 
GCACAGGTCC 
AATGCCTACT 
AGCAGATGCA 
GGTCAAGAAG 
TTTAGTTTTG 
CATTGTAAAT 
TTGGCACTAC 
CAAAGAATCA 
TCCCACGGGC 
AGGCTGTTTT 
ATAGCTTCTA 
CGAATTTATG 
GTCGAATGGA 
AACATTCTGC 
TGACTGGGGA 
CCCCTATATA 
GGTGGACATG 
TCAAATCAAG 
CCACCTTTGA 
GGAAATCATA 
TACCI 1 1 I IC 



AAAGTTCAGA 
TCATCTAACT 
TTGATCTGTG 
AGTGAATCCT 
GCTACCCCAG 
CTTGTCAACA 
CCGGCCAGTT 
GGCTTGAGAA 
GGTTATGATG 
ACACAAAACA 
ATGAAATGGC 
AAAACTGGTC 
AATAGGGTTT 
AAATGAATTT 
Al I I I IACCA 
TGGTACCAAA 
CCAAAATCTG 
TCCTTATGGG 
TGTGTATATG 
ATATAAAACA 
AATGACGCTG 
TGACCTGACT 
ATGTCCTCGG 
AGTCTTTCAT 
TTTTGTCTGG 
ACCTTTAATG 
AATTTAAAGG 



AGTTCCTCAT 
AGGCATTTCT 
GAACTTTAAA 
GAGGTGTGGA 
TGAAGAGTAT 
GAATTCCTTA 
GTGTATATGC 
TTATGCCAAT 
TATGGATGGG 
CTCTCAGAGA 
CAAATATGAT 
AGGAGAAATT 
GTAGCCTTTT 
TGCCTTGGGT 
GGTTTTTTCT 
GGTTTCTTTT 
CAACAATAAG 
CTGGATCCAA 
TCACATGCTC 
GCTTTACCAC 
ATAATATGAA 
GCCATGAAAG 
AACACCCCAG 
TAGTGCTAAG 
GGCCTTGATG 
AAGGCATATT 
TTGGTTTCCA 



FEATURES : 

5'DTR: 1-100 

Start codon: 101 

Stop Codon: 1286 

3'UTR: 1289 



Homologous proteins: 

Top 10 BLAST Hits: 
CRA 1 18000004922653 /al ti d=gi 
CRA 1 18000004903706 /al ti d=gi 
CRA 1 18000004924799 /al ti d=gi 
CRA 1 98000043616611 /altid=gi 
CRA | 98000043617058 /altid=gi 
CRA 1 98000043616593 /al ti d=gi 
CRA 1 98000043617174 /al ti 6=gi 



17434997 /def=pir| 
1542751 /def=pir|| 
14557721 /def=ref | 
112844223 /def=dbj 
112845127 /def=dbj 
112844194 /def=dbj 
112845372 /def=dbj 



IG01416 lysosomal ... 431 e-120 

S41408 lysosomal ... 430 e-119 

NP_000226.1| lipa... 428 e-119 

I BAB26283.il (AK0... 415 e-115 

I BAB26629.il (AK0... 415 e-115 

I BAB26272.il (AKO. . . 414 e-115 

| BAB26725.il (AKO... 414 e-115 
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CRA 1 98000043617140 /altid=gi 112845298 /def=dbj | BAB26697.il (AK0... 414 e-115 
CRA 1 98000043617224 /altid=gi 112845477 /def=dbj |BAB26766. 1| (AK0... 414 e-114 
CRA 1 98000043616955 /altid=gi 112844939 /def=dbj | BAB26556.il (AK0... 414 e-114 



EXPRESSION INFORMATION FOR MODULATORY USE: 
gi 1 8003062 Stomach normal 
gi (8000757 Stomoach normal 

Tissue expression: 
Human leukocyte 



EST : 

gi 1 8003062 /dataset=dbest /taxon=960 
gi 18000757 /dataset=dbest /taxon=960 



62 4e-07 
54 9e-05 
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1 MWLLLTTTC LICGTLNAGG FLDLENEVNP EVWMNTSEII IYNGYPSEEY 
51 EVTTEOGYIL LVNRIPYGRT HARSTGPRPV VYMQHALFAD NAYWLENYAN 
101 GSLGFLLADA GYDVWMGNSR GNTWSRRHKT LSETDEKFWA FSFDEMAKYD 
151 LPGVIDFIVN KTGQEKLYFI GHSLGTTTGF VAFSTMPELA QRIKMNFALG 
201 PTESFKYPTG IFTRFFLLPN SIIKAVFGTK GFFLEDKKTK IASTKICNNK 
251 ILWLICSEFM SLWAGSNKKN MNQSRMDVYM SHAPTGSSVH NILHIKQLYH 
301 SDEFRAYDWG NDADNMKHYN QSHPPIYDLT AMKVPTAIWA GGHDVLGTPQ 
351 DVARILPQIK SLSLVLSLLP EWEPTFDFVW GLDAPQRMFS GNHNL 
(SEQ ID NO: 2) 

FEATURES: 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASN_GLYCOSYLATION 

N-glycosylation site 

Number of matches: 5 

1 35-38 NTSE 

2 100-103 NGSL 

3 160-163 NKTG 

4 272-275 NQSR 

5 320-323 NQSH 



[2] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches: 4 

1 125-127 SRR 

2 204-206 SFK 

3 243-245 STK 

4 266-268 SNK 



[3] PDOC00006 PS00006 CK2_PHOSPHOj5r7E 
Casein kinase II phosphorylation site 

Number of matches: 8 

1 53-56 TTED 

2 130-133 TLSE 

3 132-135 SETT) 

4 142-145 SFDE 

5 162-165 TGQE 

6 185-188 TMPE 

7 274-277 SRMD 

8 348-351 TPQD 



[4] PDOC00007 PS00007 TYR_PHOSPH0LSITE 
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Tyrosine kinase phosphorylation site 



161-168 KTGQEKLY 



[5] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 4 

1 14-19 GTLNAG 

2 117-122 GNSRGN 

3 121-126 GNTWSR 

4 175-180 GTTTGF 



[6] RDOC00110 PS00120 LIPASE_SER 
Lipases, serine active site 

167-176 LYFIGHSLGT 

Membrane spanning structure and domains: 



Helix Begin End 

13 23 

2 167 187 

3 248 268 



Score Certainity 
1.398 Certain 
1.637 Certain 
0.715 Putative 



BLAST Alignment to Top Hit: 

>CRA | 18000004903706 /altid=gi | 542751 /def=pi r | | S41408 lysosomal acid 
lipase (EC 3.1.1.-) / sterol esterase (EC 3.1.1.13) 
precursor - human /org=human /taxon=9606 /dataset=nraa 
/length=399 
Length = 399 

Score = 430 bits (1094), Expect = e-119 

Identities = 211/394 (53%), Positives = 274/394 (68%), Gaps = 2/394 (0%) 
Query: 2 mi/clltttclicgtlnaggfudlenevnp 61 

M L CL+ TL++ G V+PE MN SEII Y G+PSEEY V TEDGYIL 

Sbjct: 3 I^FLGLWCLVLWTLHSEGSGGKLTAVDPETNMNVSEIISYWGFPSEE^ 62 

Query: 62 vnripygrtharstgprp\aamq^lfadn^^ 121 

+NRIP+GR + GP+PW++QH L ACH-+ W+ N AN SLGF+LADAG40VWMGNSRG 
Sbjct: 63 LNRIPHGRKNHSDKGPKPWFLQHGLL^SSNWV^ 122 
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Query: 122 ntwsrrhktlsetdek™afsfd^ 181 

NTV/SR+HKTLS + ++FWAFS+DEMAKYDLP I+FI+NKTGQE++Y++GHS GTTIGF+ 
Sbjct: 123 NTWSRKHKTLSVSQDEFWAFSYDEM^KYDL^ 182 

Query: 182 AFSWEUVQRIKMNFALGPTISFKYTTGIF^ 241 

AFS +PELA+RIKM FALGP S + T + LP+ +IK +FG K F + K 
Sbjct: 183 AFSQIPEU\KRIKMFFALGFVASVAFCT5PMAKLGRLFOHLII^ 242 

Query: 242 ASTKICNNKILWLICSER^LWAGSNKK^ 301 

T +C + IL +C L G N++N+N SR+DVY +H+P G+SV N+LH Q 
Sbjct: 243 LGTHVCTHVTLKELCGNLCFLLCGFNERN^ 302 

Query: 302 DEFT^YL^^NDADNMKHYNQSHPPIW 361 

+F+A+DWG+ A N HYNQS+PP Y++ M VPTA+W4GGHD L DV +L QI + 
Sbjct: 303 QKFQARDWGSSAKNYFHYhK^SYPPT^ 362 

Query: 362 LSLVLSLLPEWEPTFDFVWGLDAPQRMFSGNHNL 395 

L S +PEWE DF+WGLDAP R+++ NL 
Sbjct: 363 LVFHES-IPEWE-HLDFIWGLDAPWRLYNKIINL 394 (SEQ ID NO: 4) 



Hmmer search results (Pfam): 

Scores for sequence family classification (score includes all domains): 
Model Description Score E-value N 



PF00561 alpha/beta hydrolase fold 46.7 2.5e-13 2 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 



PF00561 
PF00561 



1/2 
2/2 



112 195 
294 352 



1 71 [. 
139 196 .. 



38.8 6.7e-ll 
8.0 0.19 
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1 TTATGGCCTA ACL I I I I IM CTTTGAGTTA TTTTCAAGAG AAAATTTGAA 
51 AAAGCAGCCT TTGAGGAGAA AGAAGCAATC CAACAAACAA AAAGATAACC 
101 ACACTGTAAT AGGAAATGTG TTTTGAATAG GACATTGGAA GAAAAATAAT 
151 AATCAI 1 1 1 1 ACAGGTAGAT CCCAAAGTCA AGGATCTATG TTCAACCATG 
201 TGTGTTCCAC CATCTTCACA ATTGAATGAG TAACCATCAT TAAGCAGTTA 
251 GCTTAGGCCG TAATATGATT CTTGGACTGA GATTTCAAAA ATACCACAGG 
301 CCTTCTGAAA GGTTACCCCT TTCTAGCTCC ACTATCATCT AATTTTATTA 
351 AAAAAAAAAA AAAAGGAAAA ATTTGAGCTT CTAGAGAGTA GGGGCTACCA 
401 TTTTGTATCC CACAGGGCCA AGGAACAAGT TTTAATGTAT TCATTTAAAT 
451 TAATTTCAGT ATGAGTATTG AAATATATAA TAGAAATATT GTAACATTAT 
501 ATATTTTCTA TATACTTTTA TTATATAGAA AATATATATT ACAGAATATA 
551 TTATTAAATA TTGTAGAACA ATATATAATA CAGAAAAATA TATAATACTC 
601 AGTAATATAT TAAATACTTA TTAAAATAGC AAGCTTATAT AGGAAGAGTG 
651 ATGGAGCATT GTGAGAAAGT TTCAGCTTTA TTTCTTTGAC ATTACTTTGT 
701 TTCTGCACAA ACAAAAGAAT TACAGGAATT GTCCAGATTA TTCAAATAAC 
751 TCGAAGTTGA GGAGGGAATA TAAGTCAATG ATGTAGAAAC TCI I I IAAGA 
801 TTTGAGCTAG CCTACAATCT GTAAAGATCT GTGAAATTGA ACTATATTTG 
851 TGCTATTTCC ATATTAAGTC AAGGCAACAA ATCAATATTA ATAATAATAA 
901 CATAGCACTT CTAGAACTTT •CTAAAGAGTC CAATAAAGTT TTGTTAGAAA 
.951 GGATTGTTTT TGAAGTTAAA AACCATGAGA AATTCCAGGA AAATCCACAT 
1001 ACCTATGCCA TCATACTATC AATCAGGGCA AAACATGCTT GAGTCTTTCA 
1051 TCAAGACTAA ATGATTAAGG AGTGGTACAT AACTTTTCCC TGTTCTGACT 
1101 AGCTGAACAC TTCCTTTTAC TCCACATTTG TTTAATTGGC ATGAAATTTC 
1151 CCACTCCACT AAAACAGATC TTAGGATTTG GACAACACAA AATATCATTT 
1201 GTTTTGAAAG GATTTGAGGA TAAATCCAAA CTAATAGAAC TGAAACTTCT 
1251 ATATTATGCT GGGTAGCAAC TTAGTTTTCC CTACCCTTCT TCATGCTGGG 
1301 AGATGAAAGA GATTCAGTTA CGGCTTAAGC TCCACAGGCA TACAAAGTGA 
1351 AGCAGAAAAC TGAGGCACGT GTGCCTCCAT TATCTGGTAT CTCATGTGGG 
1401 GCTTAGAGGT AAATTGTCGT TATTTGGCCT CCATTTCTGC CTTTAACCAC 
1451 TGGTGTAAAC AAAGGTTACT GTGCCAAAGT TGACAGCAAC CCAAATCCCT 
1501 TTGGCATGTG AATTAGTTTC CTCTGCCATA CTGCTAGTTC CAAATTCCTT 
1551 CTGGTTTCAG GATTTAGGAG TCAGGGTTGC CTCATCTTCT CAAATGAGTT 
1601 ACAGTCACGC ACATCCCTAC ACACTGCATG GTTGGCACTA GTTCCTTGAT 
1651 ATATGTTACT CCGTTTGATC CTCATGAAGG ATCAAATGGG GAAGGGAGAT 
1701 ACTATTGTCT CTGATTGTCC ATTAAGATCT TGAGTATGTT CTACTTCCCT 
1751 GTTTGACACA CTGGTTTGAA AATGTTGCTA AGTCTTCCCA ACAATGACAG 
1801 ATACTCAGTG GAAACATGAA GGATTCCGTC AAACTGGTTA TTTTGCATCA 
1851 TGTAGACCAC TATTTCCCAA CCTGCAAGTG CATCATGGCC TTTGGTGTGT 
1901 CAGGGACACG CCTTGGGTGT GTGTCTCAGT CTAAAGCTTC CTCCTTTTCA 
1951 CAAGCTTCCT GTTTCTCATC TCTCTAGCTT CTAACTGTCA CTGTAATCAT 
2001 CTCTTACTCT TCAGCCTGAT GTCAAAAGCA AAAGTTCAGA AGTTCCTCAT 
2051 CAATAAGGAG TCCTTGTGAG CAGGTGAAGC TCATCTAACT AGGTAAGATG 
2101 AAGATCTATC ATAACCAGGA GGCAGGTTGG AAGGTGCCAG TTGCACTGGC 
2151 AGTCAGGTGC AAGAGCTCTG CAGTGAGGCT GCCTGAGTGT CCATCCTAGA 
2201 TCTCTCACCT CTTGGCTCTG TGACCTTGAG CAGGTCTTAA ATCTCTCTAA 
2251 GCCTTTGTTT TTTTAATTGA TAAAATGAGG ATAATAATAG TACCAAAATT 
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2301 AGGGAGATTT TGVGAGOTA AATAACATAC GTGAACTATT TAGAGTAATG 
2351 CCTGCCATAA GGGGACTCAG TAGCTTATTA TTAGTTTCAT ACAATTTGAA 
2401 AAGTTTCATA ATATTTGCAG ATATAAGATG ATCTTCAACC AGATAGCTAA 
2451 TGTATGCAAA GCTATTTAGC TTCAGAAGTA AACTCTGCAT TTCTAGAAGT 
2501 TAAATATTAC I I IGI IATAG TGAATTATCT GTAATATTTA TCTCTTGCTC 
2551 ACTTTTATAA GAAAAATAGT GAAAGCATTT ATTAAGAACT TACACTGCAC 
2601 TAAATGTTAT ATATGACTTA ATCCTCACTA TAACCCTATG AGATAGGTTA 
2651 CATTATTGTC CTAATTTTAC TAACAAGGAA ACCAAGAGAC AAAGCTACTA 
2701 AAACACTTGC CTGAGGTTAG ACATCTTCTT CTGTGGTGAG GCTGGATTTC 
2751 AAATTTAGAC CATTTGACTG TAGCACTTAT ATGATGAGCA TGCTGTTTAG 
2801 TGTTATAGTG TTGGTCTACC TTTGAATAGA CATACTTTTA AACCATGGCA 
2851 AGGAAGTGAG ACTGCACATT GAAATATGTA AAATTTGCCT TTGGGTGCCA 
2901 CGTGAGAAAT AGTCACATCA CTAGAAACTA ATCATAAGCT TTTGTGTTTG 
2951 GTTAAAGTTT TATTGATCCA TTTTTCTTGT TTACTTTGTG GGATACTGGG 
3001 CTTAACTAGG GGATACCTCC AC I I 1 1 I ACT TGGCCATGGT ATGAAAACCT 
3051 GTCCTCTGAA TCTTTAGATA TTTTGGCAAA TTGTAGGCAA ACAAAGACTT 
3101 AAAGCAATTC AACCTTGATT AAAATAAGAC CAAAAATGCC TCCATACTTG 
3151 ATTAAATTTA TTTCATTTTA GGAACTGGAT TATAATCAAG ACAACTTCTA 
3201 CATGAAAAAA TAGATTAATA GTGGTCCAAG TTAGTTCACT GTATTTATTC 
3251 CTTTTTATAC ATTATCTGCC TTCGGTGTTA TTCAAGTTTT CATTAATCAT 
3301 TAATAATTTC ACTAATCATT TTATTTCATT AATCAACATT GATAGTTAAA 
3351 ATTAATCTGT GAATATTAAA TGTTTTATGC CAGGCATTTC TATGATGTGG 
3401 CTGCTTTTAA CAACAACTTG TTTGATCTGT GGAACTTTAA ATGCTGGTGG 
3451 ATTCCTTGAT TTGGAAAATG AAGTGAATCC TGAGGTGTGG ATGAATACTG 
3501 TAAGTCATGG AAAACTGTGA AGAACATCAA ATAAAGCAGG ACTAATGGAG 
3551 TATGAGGTTA CGAAAGGTCC TGTTGTAACA GAAAATCTCT GATAAAACAG 
3601 ATAAAATGTA GATGGI I I I I AACCTCTGCA AGAGTCAAGC TAGTTAGATC 
3651 TTTGTCTGAA AAACAAATAC TGTCCGGTAA TGAAAACCAA ATTGTGCTAT 
3701 TGTGCTATCT ATCTATCTAT CTATCTATCT ATCTATCTAT CTATCTATCT 
3751 ATCTATCTAT TTATCTATCT ATCTATAGAT AGAACCTCCT CTTTTGAATT 
3801 TATGTTTTAA GAATATCAAG CTATTTGTTG ATATACATGA TTGCCTTCTA 
3851 TTGATCTATA GTTCTATTAC TTTTAAAGCA AGAGGGGTCT CAAAAGACAA 
3901 TTGACTTGAT AATATAGCTT TGTCAGAAAG AATGGGTCAA TGCTAAATTT 
3951 TCCCCCAACC CCCCAAAATA TTAGCCAATA GTAGATATTT TTTAAAATTC 
4001 TACTTATTTT GTATTAAGAC TTTATTTATT AATTTTACAG TTACCTGGTG 
4051 CTACAAATTT CAGATAATTC ACCCTAATAA GCACACAACA GATGGTTTGT 
4101 TTTGATTCCT TTTTATATCC TTTGGAGAAG TTCCACTAAC GACTGTATTT 
4151 TTACTGGGCA GAGTGAAATC ATCATCTACA ATGGCTACCC CAGTGAAGAG 
4201 TATGAAGTCA CCACTGAAGA TGGGTATATA CTCCTTGTCA ACAGAATTCC 
4251 TTATGGGCGA ACACATGCTA GGAGCACAGG TACAAGATAT GTCTCTCCTG 
4301 AAAAGGGGAC TGCATTGACC TCCTGCTTCT CAGGAGGAAT TTAATGCTAG 
4351 ATATGCATCA ACAGAGTTTA TCAAAATTGG TTTGAATTAT TGGATTAGTC 
4401 TTTAAATAGT TATCAGGGAG GCTCACTCTT TGCCTGATAA TTCTCTGAAG 
4451 ACAGACAGGA ACCTAAAAAT ACAAACAGCA AGACTGATCT TGCTAACTGC 
4501 AACCAGAGGT ACTTGTTAGG GTGTAAACAG AAAGGCAGAG CCTGCATTTT 
4551 GTCACCTCAT TACTGATTTA TCATGTGGAA AATTGCTTTG TCCCAGGAAA 



FIG 3B 



Docket No.: CL001186DIV-II 
Serial No.: (to be assigned) 
Inventors: Gennady V. MERKULOV et al. 
Title: ISOLATED HUMAN LIPASE PROTEINS... 

4601 ATGGATCCTC TCATTGTCAG AAGGAGATTT TCTAGGTTGT ATGAAATTGA 
4651 CTCTGGGGCA CCCAAGAAGA ACCTCTCCTG CTCCCACTAA AATTAAQQGG 
4701 CCTCCCTCTG CAGGATAAAA AACAATCTAG TTAAATGACA ACGCATTTCT 
4751 GAAAAGTTTT CCAGGACTGA AAACCTTAAC ATCCACATAC ACTTTGATCT 
4801 AAGGGACAGA CGGTTCATAG AATGAAAGAG TATGGTGTCA ATAAGGCTTG 
4851 AATTCTAGAA TGAGGAGCCA GCCATGCCAT AGCAGGGGAA TGATACTCCT 
4901 TAAAAGGGAA AATTTAACTA CAAATCCTCT GAAGTAGAAA TGATAAGAAT 
4951 AACCAAAATA TCTGCAATGG TTCAATAGCA AATAATTTAT TGGCAGCTGC 
5001 TTACCGTGTT CATTTTGCAT CI I I I I ICCC ACCACACATA TTAAGGAGCA 
5051 GCTGAAGTCA TGTTTGACAT TCTCTCCCTC TTTTATCTCC AGTTTCAGAA 
5101 TGAAAAATGA GAGTGAGATA TGAGTAGTTT TACTAGTTAA AATATGAAAC 
5151 ACCCAGTTAA ATTTGAAGGT CAGATAAACA ACAAATAATT TTGTATAAGT 
5201 CTCATTTTAA GATAATACTA AAAAGTCATT ATTTATTCAC TATTATCACT 
5251 ATTTATAAAA TTTTGTAGAG CATCCTGGAT LI 1 1 I I GOT AC I 1 1 IGI I I 
5301 TTAI I I I I IG CTAAATCTGG CAATCCCAGG CACATGTGTG AAGGAGCTGT 
5351 GAAATATAAA AGGAGAAAAC TTTTATGGGA AAGATTTGGC TTAAGGAGAG 
5401 ATAATTTTGG AAAGATTTAG AATTAAAGAT CATTCATTAG ATGTAATGTT 
5451 CTAAATACTT TATATCAGTT AAACTTCTCA TCAACAATAT GAGATGGGTA 
5501 CCACTAATAG TCACCATTTC ACAAATGATG AAATTAAGGC ACAACCGGTT 
5551 ATGTTAAGAG GCCTAAAGTC CACAAATAGC AAGCTGACAG ACCAGAATTT 
5601 AAGCCCAGGC ATGCTGGCTC CAGAGCCTGT GCTCTTAGTC ATTAAATTAT 
5651 AGTGCCTTAC TTGACCTTCC ACCCTGGTTA CTTTGGATCT CCCTGAATGC 
5701 TCTCTCTCCC TCAGAAATAC TGGAAGTTGG CAGAGGGACA CTGAGCTGAG 
5751 CATATTATTG TAG I I I I IAA ATGCTCTCCA CTGGACAGAA GATGGGGGAT 
5801 TTGAATAGAA ATTTGGTGAG GAACTAATCA GTGTCCATTT ACACTCACCT 
5851 CCTCTTCCTC CCTGGAAGAG CTATAGGACT TGAGTAAGCA TGATAAATTT 
5901 CGTGTCTTTG TAAACCACAC CCAGGAAATT TGTATATACA AATACATAGA 
5951 GCACAGTAGT TATCAGGACA GACTTTGACA TAAAAAGAAC TGGGTTTGAG 
6001 TCCCTGCTCT GGCCTTCTTA TCTGGGTGGC CCTCTGGGAA AGTTACTTAA 
6051 CTACATAAAG I I I IGI I ICC ATATCTACAA AATGAGGTTT CTCAAAATAG 
6101 CAGCTAGTTT ATAGAGTTGT TGCAAGAATT TAGTAAGCTA ATACATATAA 
6151 ATACGTCAAC ATAGCACCAG GTACAAAAAT ATGTGCTCAA GAAACTGAAG 
6201 TTACCTGATT ATAATGCTCT ATACTATTGA CAAGGGAAAA GTGAAAACAG 
6251 I I I I IGI I I I ACCATGTGTG TATGTGTGTG TGTCTGTGAT GTTTCCGACA 
6301 TGCTCTATTT AACATAAATT ACTCTCACTC TTTCTCTCTC TCTCTTTCTC 
6351 TTTCTCCCTC TCTCATCTTA CCCTTTCCCC CACCAGGTCC CCGGCCAGTT 
6401 GTGTATATGC AGCATGCCCT GTTTGCAGAC AATGCCTACT GGCTTGAGAA 
6451 TTATGCCAAT GGAAGCCTTG GATTCCTTCT AGCAGATGCA GGTTATGATG 
6501 TATGGATGGG AAACAGTCGG GGAAACACTT GGTCAAGAAG ACACAAAACA 
6551 CTCTCAGAGA CAGATGAGAA ATTCTGGGCC TTTAGGTAAA TATTAGCTAA 
6601 GAAAACTCAA GGGGGAAATT GGAGGCAATT TTAAAAAAAT AACGTGGACG 
6651 CTATTAATGA TTATCTTTGA CGCTTGAAGT CATATAGCTC CTTGTAGTTT 
6701 CTGTTAAGAT CTCAAAGGAG GGTAACAGCA AGAAGCTCTG ATTTTTCACT 
6751 GATTCTCCCA CAAGCAAAGT ATGGCATTTC AACAAGATCA I I I I IACATC 
6801 CAATTCTGTG AATTCTATGC ATTAAAAGTA TGTCCAAAGA GACAGCTCAG 
6851 GAAATTATCA TGACCAATGT GCACATTCAT TCAGCCAATG TTTACTGAGT 
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6901 GGCTACTGTA TGCGCTGTTC TAGGCCCCGA ACATTCAAAC AGGGAACAGA 
6951 CAAACTCTGA CCTCACAAAG CTTATGTTCA TTTTAGTGAT AATTTTACAA 
7001 GTCATTGCTC CTGGATTGCC AATCAACTGT GTAAAGATGA TTTGGACCAG 
7051 GACCTTATTG ATTTAGAGAA ACTGTGATTG ATTTAGAGAA ACTGAGATCG 
7101 CACATAGTAC CATTTTCAGG AAAACTCCAA TATTAGATTT TTAAAACCTT 
7151 GTTAATGGGC AATGAAGAAG AATCI I I I I I GATATCTTGT TTCTTTTAAT 
7201 GGAAGAGTTT TCTGCTGTCA CCAGAGGACA GGCTGATGCC TGCGATAGAC 
7251 TTTTCTTTCT TCAGGCCTAA GCTCCCTGTT GGTTTGTAAA CCTGATGCTA 
7301 GAACAGACTG TGTATTCCTA TTACATTAAT AAAACATTCA GTACCCACTG 
7351 AAAGTTTGAG AATAGTGGAG GAATAGAATA GAATGTTATA GTCTGAGTTC 
7401 TTGGGCAGGG GCAAGCATCA GGAAATATTG AATCATTAGT CTTTAGGAGG 
7451 TGTCACAACA ATTCTCCTAT TCTTGTAAGT CCCAATCTAT AGATTTCCTC 
7501 ACATGTTCTT TTAATAAACA GGCTTCTAGC TTATGGAATA CCTGATTTGA 
7551 CTAAATGTTA TATAGGCCCT I I IGI ICCTC CTGTCTGAAG AACAAAATAC 
7601 TAGTACTATG GAATATTGGT ATATATTAAA TATATATCTA TATATCCATG 
7651 TGGACAGGAA TACTACTACT AACAACATCT TACTGAGCAC CGACTGGCAG 
7701 CCAGAGTCGT TTCTTTCATA CTATTAAACC CCGTTAGCAG CCCCGTAAAC 
7751 CAGGTACTAC CCTGTTTATT TCCCAAATGA GAAAAGATAG GGTCAGAGCA 
7801 TTTCAGTAAT TTCTCAAGAG TTGCAAAGGC CATAAATAGT . AGAATCATGA 
7851 TTTACAAAAC CCCTGTTTCC AAAGATGGGT ATTAAATGGT CCTAACAATT 
7901 GTGAAGCCTC ATGTGGGAGT CAGAAGTAGA GGCACACAAG CCAGATGGGG 
7951 AAAGGGAGGG CAAAGAAAAG CAAGAGAAGG GAAGGAAGAG GAGGGATCAT 
8001 AAGGTTGAAC TTCAAATATC ATACACAAGT TTCGAAAGTG TTCCTCTTAT 
8051 AAGGAAGTAA AATGTACATA TGCAGAAAAA CAAAAAGCTA CAATAGCCTA 
8101 CATATAATTG GATAAATAAT GAAATACACA TTGAATCTAA GTAAACAGCA 
8151 TAGAATCTGG GTGTAAAAAA GAAGTGAGCA AGTGCTCTGA GTTTTAAACT 
8201 TAAACTTGCA AGTATTTATA AAAGCCCCTG TTTTATTTTG CAGTTTTGAT 
8251 GAAATGGCCA AATATGATCT CCCAGGAGTA ATAGACTTCA TTGTAAATAA 
8301 AACTGGTCAG GAGAAATTGT ATTTCATTGG ACATTCACTT GGCACTACAA 
8351 TAGGTATGTT TATGAGGGTC ACTGTTAGGT GTGTTTTTGA GGGTCAGTTT 
8401 TCTCAGAGTC TTACAGGAGT TCACCTTTAT GTTGGAATAA AACAACTGTT 
8451 ACTTATAGTG CCCTCAATTC CCTGTCCTCT GCTGGGAATA ACCCTAGTAC 
8501 TCTAAGTAGC TGTGAGCCTG CAGTGCACAG ACTATATGTA GGGCAAACCT 
8551 TTCCTGGGTC TCTGGTCACA GCAGCATATT GACTACGGTG ATGCAATTTC 
8601 CGAGGAATAA CATGTGTTCC AAATTCAAAG AAATAATTCC ACAGAGTAAG 
8651 TTTCTAGATT CCCTCTGAGC TGAAAAAGTA AAATTCAATG CCATGGAATA 
8701 TGGCTGAAAC ATAATAAATG TGCATCAATC ATCTCTTTCT CACAACCCAA 
8751 ATGGGATTTT TAAAAAATAA AAGGGAAGGG CTTATACCTA TATTTAAACA 
8801 AATTGAAAAG GCATGGTTAT ATTTGTTTGT GAGTTGGAAC ACACAAGCTT 
8851 ACTATAATAA ATCAATTGAG CTTATCTATT CAGTGTGTGA TTTAGTATTT 
8901 ATGAAATAGC AAGTAAATGT AAGCACTATG TAGAAATTTC TAAAGI 1 1 I I 
8951 TAAGCTGACA ACTTACTTCT TAATTTACTT ACTTTACTTA ATTTACTTTA 
9001 CAATTTACTT TCCAGGTATT TTGGAAAGAA ATCAATAATC TAGTTCCAAG 
9051 TAAAAGTTGA AAGGAACCCA CACTAATAAA AGCTTTGAAT TTGTCATTGA 
9101 ACTTCCACTA AAGTTTCCAA TTTTAAGAGA ATAAATCATG TGAAAGTGCA 
9151 ATATTTCAGT TTAGGGAAAT ATTTTCATTA TCACCACTAT CATCAGTAAC 
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9201 AAACATATAT TCATTAGTAT TTTAGATTGA CAGGCACTTT CCAAGCTCAG 
9251 AACAGGCAGT TAGCATCAGT CAGCATATAC TAAAAAAGTA TCAAAGAACT 
9301 CATAGGAGAT CAAAAATGCC ACCAATAGGC AAATAATTAC AGTATCTAAC 
9351 ACTTATTGAG CATTCGTTAT GTGTAGGGTC TTGTGTTCAG GACCTTCCCC 
9401 ACAGTATCTC CCTCTGATCT TCAAAACAAC CCGAATGTTA TTATCCCCAT 
9451 CTCATAGAAG AAGAAACACA AGTTCAGAAC ACAGATTCAA ACCAGATGTA 
9501 TCTGATTTCA CCAATAGGGT GTGTAAGGAT TCCGGAGAAA TGGTGTAGAG 
9551 AAGAAGAAAT GACTTTAGTT GGTTTTGGAA AGTGGGTAGG ACTTAGATAT 
9601 GCTCTTATAC TTGATCTGCA AAAAAAAAAA AAAAAACCAT GGAGAATTTG 
9651 ATTATCTGTG CTCTGTGTTT CATTTAGGAC ATAAATATTT TTAGTGACTG 
9701 TTGTTTGCAT TTTGGACAGA GCAATTTCTG TTATGTAAGG AGCACCCACT 
9751 CTTTGTAGGA CATTTAGTAG GTCCCAGCCC ATTAAACAGG GCTCTGCAGT 
9801 CAGCGTGACC CTCAAAAATC TCACCTCCAC ACATTTCCAA ACACCCTCTG 
9851 GGGAAGTACT ATTCCTGATT CAGAGTCTTT TTATCAATTG TTCAGTCAAT 
9901 TATTTCAGTT CTTCTTTTTC TGGCCAAGAC AGTTTTAATG TTCCAACAAG 
9951 TGTTTCAGTA CACACATACA CACACACACA CACACACACA CACACACACA 
10001 CACATGCTAG TGGAGGCCCA GGAAGGGACC TCTGGAAACC AAATTATATG 
10051 GATATTCTCC CTAGCCTACC CAGTGTTGTG CTAATCTCCA TCCTCACAGA 
10101 TATACAAAGG GGTGCAATGC TACTGCTGAA AGAGCAAAGC AAATGGAGAT 
10151 GCCTGGTCCT TACTGGGCCA TCGTGGATGC TAGGGAAAGC CCCTTTCTTT 
10201 TTGGAAACAG GGAAGAGTCT AGAGGGTTGA AAAACACCCA GTAAGACACT 
10251 GGGAGCAGTG AAATTTCATT CCATAGTGAG AAAGAAAACC TGTTAGAATA 
10301 ACTGGGTGAT GCTGCAGAAA GAAATCAATT CACCTCCTGT GACTGATTAT 
10351 TTGCTTCTGG AAGCTCTGTG ATTCATTCTG GCATCTCAGA GTTAGGGATG 
10401 AAATGAGAAT GTTGCCAGCA TTTACCCCAT GCTTGGGAAG TTTACACAGC 
10451 AGTAGCTACT CCAGCAGCTT AACCATCACC TTTCCCCTGC CAACTACTCC 
10501 ATTTCCCCCA ATCAAGTCAA ACTGTCCATA AATAGAATAA AATAAAATTG 
10551 GAGACTTGAG AGCAGAGAAG ACTGAAGGCA GATTATCTTT ATAGAATAAC 
10601 TCAGAAGACT TCCAATTCAT CCCCAGTATG ATCACGATAG AAGGAAAAAA 
10651 TGACTAAGCA GAGCCCCAAT I I IGI IAGAA ACATTGCGTA AGTATTTATT 
10701 TTTACAAGAT TGTCTTATCT CCTGTTCTCT CAGGGTTTGT AGCCTTTTCC 
10751 ACCATGCCTG AACTGGCACA AAGAATCAAA ATGAATTTTG CCTTGGGTCC 
10801 TACGATCTCA TTCAAATATC CCACGGGCAT TTTTACCAGG I I I I I ICTAC 
10851 TTCCAAATTC CATAATCAAG GTAGGCTCCT TTCAACAAAA TGTACCTGAG 
10901 GATCTCATTT TGGATCATAA ATCCTTATTA TTTTCAAATC TACTGTAAAG 
10951 TAAAAGTAGG AAATTTAGAT AAAATCTATA GAACTTAGAC TCTGTGGGTA 
11001 TGTGCTTGTG TATGTGTGTC CCTGCGTGTG CGCATGTCTG TGCCATAGTA 
11051 TCTGCAGGTT CTGTAATACA ATTTACTATA CAAGGTCATC AGCAGGCTGA 
11101 GTATATGTCA GAATTTCTAG CTGAACTGAG TGCTATATGA CAACAAGGAT 
11151 I 1 1 IGI IGI I TTCCCAAGTG I I I I I IGI IC CATTTAGTCA GGTAGGTCAA 
11201 TGAATTCACA TTGCCCAAAT GAAAGACACT TCAAGTTACC CATAATCACT 
11251 GATGTGTCCA ATTTTGACAT TAGAAAAACC TGATTAATAT ATTCCTTCCA 
11301 ATATGGAAAC TTGCCCTAAT AACTAAAGCT AAGATTCCAA AGCCTAAATG 
11351 TATTACAGCT CAAGTATTAA TTCAAATATT TATTGGTTAT TTTTCAGGAG 
11401 TTGAAAAAGT CATTTGGTTG CCAATTGTGG ATTTGGGATT TTATCTATTA 
11451 AAGGGI I I I I I I I I I I I I IC TCTTTGCTTT TGTTTCTCTA CAAAGGTCAT 
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11501 TGCCACAATG AACACAGCAT TTAATCAAAT TCCAGATTGG CCnTGAACT 
11551 TGGGATGATG GATAAAATGG ATTTGGGCCA AAATTGAAGT CAAGGAGACC 
11601 AGTTAGAATA TCAAAATAAT TCATATATAA GAAAATGAGA CGTTGGTTTG 
11651 GGGTAGAGTG GTAGGAATGA AAAAAATTAT TTGTGAGCTA ACACAAGGAA 
11701 TAATTTCCAT AGGGCCTAAT AATAGTTAGG TCTGATAATA CTATGGTCTG 
11751 ATAATAGTTT TATTGTATTG TTTACTGAGA GCACAAATGA TGTAACTTCC 
11801 TTATTCAAGA GCTTTTCTAG TTTATTTAAA AATGTGTTGA CATCAGTTAG 
11851 GTTTTAATGT TTTCTATATT TGGACAGTGT GAGCAAACTA Al I IGI I AAA 
11901 TTAAATTCAG AGAGAGATAC ATCTATCTGT AAATACATAT ATGCGTTGTT 
11951 TGTGTTGCTC TTCCTACATA GGTCAGCTAT AAGGCAAATA ATGTTCCTGG 
12001 GTTATCTCAG TTTCACATTT CCCACTGTCA ATATTCCTGC TACTTTTAAG 
12051 TCCCATATCC TGCTCTTTTC TTCCGTCAGT TTCCCCCAGA AGCTCCAAGA 
12101 CCCCACCAGG AATCCCCATC CAAGTTTACT TTCCCAACTC CTGGAAGTTT 
12151 CAATTGTGCT GCCTTTGTGA CATTATCATA TCTTTTCTGT TCAATGGTTG 
12201 CTTCTCTTTG GCTCACTGTT CTCTACTTTT CAGCCTGAGA GCTGGCTAAT 
12251 CTGGGACAGT ACTCGAATGC AGTGTACACA TGGGTAACAT GGAAAACCCC 
12301 GATTTTCCCT TATATTCAAG GTATTATTTG ACCTTAAGAA AAACTGI I I I 
12351 ACATTTCATA CCAATTAATG AGAAAAAAAT ATTGGCAAGC ACTGACTGGG 
12401 CAGAATACAG GGAAGCTTCA CTATGGAGAA GTGAATTTGG GATTGAGGGC 
12451 CTTTATTGCA ATCTCCTTGT AAATAATATT TGATACTCTT CCTCATCTGG 
12501 AGACACATTC CTAAGTAACT TTTCCTGAAT AATTTGGTCT CCTTGACTGA 
12551 ATCAGTAAGT ACAAATAGAT CCCCAAGCAT GGCTCTTTCC TAGAATGAAA 
12601 GAAATGTCAA GAAGTCTGAA GATGATTCTT GAATTTTGGT I I I I IGCTAT 
12651 TGCTATTTGG GCTTGTTGTC CI IGI IGI IG CTATTGAGTT GAGCTCCTTA 
12701 TATATTCTGG TTACTAATCC CTTGTAATAT GGATAGTCTG CAAATATTTT 
12751 ATCTCATTCA AAGATAATTA TTATTTACTT TCATAGGCTG I I I I IGGTAC 
12801 CAAAGGTTTC I I I I IAGAAG ATAAGAAAAC GAAGATAGCT TCTACCAAAA 
12851 TCTGCAACAA TAAGATACTC TGGTTGATAT GTAGCGAATT TATGTCCTTA 
12901 TGGGCTGGAT CCAACAAGAA AAATATGAAT CAGGTATGTA TGATAATTAT 
12951 AGGGCCATTT GATACCTTAA GAAATTCCAG CTTTCCTTTG ACTCATTTTG 
13001 ATATATCTAT TTACTGTATA AATTCATATG GTATTCCAAA CCCTTAAAGA 
13051 GAGA I I I I I I TTTGCTTTTA AAAATGTTTA TGGGTATATA ATAGTTGT AC 
13101 ATATTTATGA GACACATATA TTTTGATATA AGCATACAAT GTGTAATGAC 
13151 CAAATCAGGG TAATTGGGAT ATCCATCACC TCAAGCATTT ATCAI I ICI I 
13201 I I IGI IAGAG ACATTCTAAT TTGACTCTTC TAGTTATTTT GAAATATACA 
13251 ATGAATTATT GTTAACTATA GTCATCCTAT TGTGCATGCC AGACTTTAGT 
13301 CCTTCTAACG GTATTTTGGT ACCCATTAAC CAATGCCTCT TTATCCTTCC 
13351 CCCACCCCTA CTACCTTTCC CAGCCTCTGG TAACCATCAT TCTTCTCACT 
13401 ATCTCTATAA GGTCAGTTTT I I I I IAAACT CCCCTATATG AGTGAGAACA 
13451 TGCAGTATTT GTCTTTTTGT GCCTGGCTTA TTTCACTTAA TGTAATGTTC 
13501 TCTAATTTCA TCCACATTAT TGCAAATGAC ATGATTTCAT TCI ICI IATG 
13551 GCTGTCTATA TGTACCACAT TTTATTTATC CACTCATCTG TTGATGGACA 
13601 CTTAGGCTGA TTTCATATCT TGGTCATTGT GAATAGTGCT GTACTAAACA 
13651 TGGGGGTGCA GATGTCTCTT CCATGGATTG ATTTCCTTTT I 1 1 I I ICTGA 
13701 ATATAGACCT AGCACTGGAA TTGCTGGATC ATATGGTAAT TCTACTTTTA 
13751 Gl 1 1 I I I GAG GATCCCTCAT ACTCTTCCCC ATAGTTCCTG TACTAATTTA 
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13801 CATTCCTACC AACAGTCTGT GCAAGAGTTC TCTTTTCTCC ACATTCTTGT 
13851 CAGCATCCAT TATTGCCTAT C I 1 1 1 I G ATA AAAGCTATTT TAACTGGAGT 
13901 GAGATAGTAC TTCATTGTAG TTTTAGTTCG CATTTCTCTA ATGATTAGTA 
13951 ATGTTGAACA TTGTTTTTAA TGTACCTCTT GGCTATTTGT ATGTCTTCTT 
14001 TTGAGAAATG TCTACTCAGA TCTTTTGTCC ANN IAAAT CAGAI 1 1 1 1 1 
14051 TTTTGCAATT GAGTTATATG ACCTCTTTAT ATATTCTGGT TACTAATCCC 
14101 TTGTCAGATG GGTAGTTTAC AAATATTTTC TCTCATTCAA CAGGTTCTTT 
14151 AGTTCACTTT GTTGATGGTC TCCTTTGCTT TGCAGAAGCT TTTTAGCTTG 
14201 ACGTAATCTA ATTTGTTCAT GTTTGCTTTG GTTGCCTGTG CATTTGAGGG 
14251 CTTACCTCAA ATTGGCCCAG ACCAATGTCC CGGAGTGCTT CTGTAATGTT 
14301 TGI I I I I I AG TAGTTTCATA GTTTTAGGTC TTAAATGTGT CTTTAATCCA 
14351 TTTTGATTTT Gl I I I IGTAT CTGGCAAGAG ATAGAGATCT AATTTCATTC 
14401 TTCTGCATAT GGATATCTAG TTTTCCCAGC ATCATTTCTT GTGGAAATTG 
14451 TCCTTTGCCC AATGTATGTT CTTGATGCCT TTGTTGAAAA TTAGTTGACT 
14501 ATAAATGTGT GGATTTATTT GTGGGTTCTT TATTCTGTTC CATTGGTCTA 
14551 TGTGTCTGTT TTTATGCCAG TATCATGCAG TTTTGATTAT TACAGGTTTG 
14601 TAGTATAATT TGAAGTCAGG TCATGTGATG CCTCCAGCTT TGI ICI I I I I 
14651 TCTCAGAATC TTATATTTAG AAAAACGTAA AGACTCCAAC AAAAAACCTG 
14701 CTAGAACTGA TAAACAAATT CATTAAATTT GCAGGATACA ACATCAACAT 
14751 ACAAAATTCA GCAGCATTTC AATATGCCAA GAGCAAATAA TCTTAAAAAA 
14801 AAGAAAGAAA AAAAAACAAG AAATAATCCC ATTTATAATA GCTACAAATA 
14851 AAATAAAACA CCTAGGAATA AACCATACCA AAGAAGTGAA AGATTTCTAC 
14901 AATGAAAACT ATAAAACACT GATGAAAGAA ATTGAAAATG ACATTAAAAA 
14951 ATGGAAAGGT ATTCCATGTT CATGGATTGC AAGAATCAAT ATTGTTAAAA 
15001 TGTCCATATG ATCCAAAACA ATCTACAGAT TCAATGCAAT CCCTATCAAA 
15051 ATACCAATGA CATTCTTCAT TGAAATAAAA AAAAAGCCTA AAATTTAAGT 
15101 GGAACCATGA AGGTAGATGT CTGCTATACA TAGAAGATTA AGTACTCAAC 
15151 AAACCTTGAA TATGAAGACT GGGGAAGTGA ATAGGCAGCT TCACTCTTCT 
15201 ATTCCCTGGT GAAATTTAGG AGAATGGATG TTTTATAATG GGTAGCAGTT 
15251 TCTTACATGT TCTCAATCAG CCATAACTTA CTACAGTCAA TTTGAATTTA 
15301 TTGCATTTGA ATATATTGGA TTAAAAATAA AATCCTAAAA AAGGAGAGAA 
15351 GCACATATAA ACCTGCGTCT TATTTCATGT GTTCCTTTCT TTGTGGGTGA 
15401 CI I I IGI I I I GAAATAAAAC CTGCAAAATA ACAGGACAGG GTGGAAGGGA 
15451 GATGGGATCC CCTCTTTATG AAGAAGCAGC AGTCCTGTTT TATCACCTCT 
15501 TCATTTTCTG TTATTGAGAA TTCAAGAAGA AGGAGGAGGA AGAGTTCACA 
15551 TCCACAGACT GGTGTGGTTG AATAGTTGTC TCTACTGTAT TCCAAATAGC 
15601 AGCCAATGAG GCTGTTACAG TGAAGCCAGT CCCAAGATAA TTGTTCTGTA 
15651 CCCCTATTCT CTAAGAAGCT AAATTGTGTT AGACTGAAAC CCATAAGGAA 
15701 CCATTGTTCA AAGTTGGCTT GTTCAAAAGT AAAGAI I I I I AATAGTTTCT 
15751 CTTAATTAGA TTATTTTCTA AGACATAGAA TTATGATTAC TATTTTATCT 
15801 CTATAATTTT CATCTCTATA ACGTTTACAA ATACTGAAAT AACCTTTGGA 
15851 AAAAATTGGC TTTTAGCTTT AC I I I IGCAA TATTTTATTT TATCCCCATA 
15901 AAAGCCTAGG AAATTGGTAC TATGACTTTT AGTATGTTCA TTTAATAGAT 
15951 GAAAACACAG AAACTCAAAG ATGTTAAATA TGGTGGCCAA GTTCACAAAG 
16001 CTGATCATTA ACAACAACAG GGCCTGAACT CCTGGTTTTC TGATTTAATC 
16051 TGTGACAGTG CACCTGGGTG CGCATGCATG CATCACCCCC ACACTTGCAC 
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16101 ATAGAACCTT TCCTAGTTGG CTTTGCTCCA TGATGACCAT TACTGTTCCT 
16151 TCTACTTCAA AATAAGCAAA TTATCCTACA GATTCAGAGC TGGTACAGGT 
16201 GTGCTGTCAA GCAGCCCATT CCATTAGTCA GCTTGTGGTT CACTCACATT 
16251 AAAGTATTGA CCTAAATGGT ATATTTATCT AGATAATTCT ACL I IGI I AT 
16301 TTTCAAAGCC CCAGTCTTGT TTGCTAATTC TGTGCATCAT TTTTCTCTGA 
16351 TTCTGAAAGG CAAAATTTTG TTGGGCAATT GCTGTAATAT GAGTTTTATC 
16401 TCOTTAGAG TCGAATGGAT GTGTATATGT CACATGCTCC CACTGGTTCA 
16451 TCAGTACACA ACATTCTGCA TATAAAACAG GTAGAGTCTT AGTCATGGAA 
16501 AACCATTCCA ATCCTTATTT TCAATATATT TAAAAAGACA GAATTGACCC 
16551 TGTTAACAGG CCTACCCTAA GAATCTTAAG AGCTTGCTTC CAGTTTGTCC 
16601 TTGCTGCCTT CTGTATGCCT TGATTTCCCT GGAATTTAAG AGAAAGGATG 
16651 TTATGGTACA GACCAAGTAG ATGACATAAA TGAACACCAC CTTAAATCAG 
16701 AGTTTTAAAA ATAGGCCCTG AACTGAAGCA AGAGGTAAAC TAGGGAAGCC 
16751 TCAGGAGAAC TGAGACTTCT CCAGAGAGAA GTATCTGGGA TTTAACTTCT 
16801 TTCTAATGAG GCTTGGTTTT CCATGAACTT TTCCTTTAAA CCAAGGGGGG 
16851 TATTGCTCAT CTTTCTGTTG AGCCCCATTT GTCATAATTG TAAAATGGGT 
16901 GGTTACATCC TTCTGGTGAT CTAGGAGCCC TATTTTCGTC CTAGCATACA 
16951 GCAI I I I ICT AAAATTTGCT GTTAGCTTTC ATGATTCTTA CCCTAACTAT 
17001 TCTTTTTCTA AAAAACATTT GTTTCAGCTT TACCACTCTG ATGAATTCAG 
17051 AGCTTATGAC TGGGGAAATG ACGCTGATAA TATGAAACAT TACAATCAGG 
17101 TGAGCTATTT ACAGTAACCC CAGCATGCTG ATTTTGATAA ATTATAATAA 
17151 AAAATTATTT GAGGGTGGAA AGACTCCTAC CTGTCATTTG GTGGCATTTA 
17201 TACTGATAGA ALI I I I I I I I AAAAAAATTT TAATTTTAAT TTTAATTTAT 
17251 TTCAGAAAAT TTATAAATTA AAGAAGCATA TACAAAGAAA CTTACATCAT 
17301 GTGTAATCCT TCCATCCAGA GATAACTAGA TGTACTAACA TTTTGGTGTA 
17351 TTTATTCCAA TTTTCTCAGT ATTATATTGC TTTTAGACAA CTTTTAATCT 
17401 TTCTATTTTA CTTAAGCTAT AGTAAGAGAT AACTAATATA ACTGAGGGAT 
17451 TTTTAAATGC Al I I I IAATG GCTACATAAT AGAAATTATT TCATAAAAAT 
17501 CTTTACAGCA TAAATGAATA TACALI I I I I AATACCAACA GAAAAATTAG 
17551 AATTCCATAT GAAAGTTGAA TAAGTATTAC CCAACATTGA AGACTTGGGT 
17601 CGTAAGGCAT CTTTCTCCAT ATAGCTTTAT GACATAAAAA TCTGTAGCCT 
17651 TGTTTAGCAC CGTACTTTTA ATTAATCCTG TCACCATTTT TCTGTTCTCA 
17701 TAGCCAGGGG CTTGGCTTAT AAGTATGAAC TAAGCAAACT AAATTAAATT 
17751 GTTTTAAGTA TTTTCCCAGG CTATCATATT TTAAGCTATT TACTGGTGCA 
17801 ACTATAGATT ATTAATAAGT TGTTTCTGAG GATCAAAACA ATCAGACTAA 
17851 TCAATTTCTC AATAATGAAT TGGCCTGTTA GAGGAATAAT TCTACTAATC 
17901 CTTAAAACCA CTACAAGAGA TAGACCATGT ATATTTTATT TAI I I I I AAA 
17951 AATAAGTTTA AGATGTGATT TACATACAAG AACATTACTA ATTTTGTGTG 
18001 TCCCATTTAA TAAGTTTTGA CAAATATATT TATTTGTGTA ACCACACCAC 
18051 AATCTAAATA TAGGACGTTT ATATCACCAC TAAAAGTTTT TTTCCTGCTC 
18101 CTGAGACTAT TTATAGACAC AAATGCGTGT ATTTGCAAAT GCTTAGAAAA 
18151 GGTCTAGAAA AAAAAACAGT AAATGTTAAA GTGGTTATCT TCAGAGAGAA 
18201 GAAAGAAGAA AAGAAGTGGA TGGACATGAA ACAGTAAAGG ACCCTCATTT 
18251 TGGACTTTAC ATATGTCTGT TTTCTTCCAT TATTTTGAAT AAACATGCTA 
18301 TATTTATAAA TTATTTACAT TTACAAGAAA ATGAAACAAA ATCAACACGC 
18351 ACATTCAAGA TCATTATGGT CAAGTACTAA AGTATGTGAG AGTGTTAATG 
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18401 TCOTAGAAT TTGGCCACAG TTAGCTGGTC CTACTCTGCT CCAAGCCGGT 
18451 CCTATTTTGT GAATTAATCT CATTTGATGC CAAI I 1 1 IAT TACATTCTCT 
18501 CCAAAAAACr AGTCTCAACA GTTTGCTCTC TCCTCAAGTT CACAGCATTA 
18551 TCTCTGCTAT ATCTATATTT TATTGAGTAT AAGAGAATTA ACCCATGTAA 
18601 GCTCCATGAG GGTAGGGATT TCTCATCGTT TTGTTCACCA GTGTTTTCTC 
18651 ATCTTGAAGA GTACATGACA ATTACTGGGC TCCCAGTATC TATGTGTTGC 
18701 ATTAATGAAA TTTCTTAACT TTAATCTACC TCAAAATGTC TCTATCTTCT 
18751 TGATTCTCTC CTTCCTTTCT CTATCAGAAA ATGATGGTCC TCTTATTTTC 
18801 CAAGTTATTC CGGTCCTGTG CCCTTGATCC CATCTCTTCT CACTTCCCCT 
18851 TCCTTCCTGC CTCCATTCTC CTGTCCCTTA TGAAAAACAA GCAAGACCAT 
18901 CAATTCTATC AAGTTATCAT TATGTCACTC TGTTCTTATC AACATATTTT 
18951 TAGTATTGAA GAGGGCTTCT TCTACTTACT CCTGAACCTT GTACAATGTA 
19001 GTTTAGGTCT TCATCI I I I I ATCATAGCTA CCTTATTTAA AGTCACCCAT 
19051 GGCTTTTAAT TGCCAAATTC AATGGCCTAT CTTCACCTTT TGAAATGTGT 
19101 TATGTTCGTT ACCACAGTCT CCTTGAAACT CAGTCCCCTG ACTTGGACTT 
19151 CCATAACACA ATGATTTCTG ATTTTCCTTC TGTTTGTGAT TGTTCCTTTT 
19201 GTCCCAGGCA CTGGCTACTC CACCTTCCAC CTCTCTGAAA TCATTAGCAT 
19251 TCCCCAAGGA TTCTTCAAAA CTCTCTTTCT TCCTTGGAGA AGTCAGCATA 
19301 GCTTTAATTT GGACCATTTC TATGGCTTAT CTAGAI I I I I TCAGGACTTG 
19351 CCTTCAACCT ATTCTTTCTG TAGGTGATTC CATTAACTGT TGCCCATATG 
19401 GTAGTCCGAA GACAGACCTC CGAGAAATGA CCCTTGTCTC CAAAACTTCC 
19451 GCAATATGTC CAAATTTCCT AGCCTGACAT TCAGACTTTG ATTATCTGCC 
19501 TCCAAGTTTA TATCCTATCA TATTCCTTTA TATATTCTGT TCTCCAGGTA 
19551 CACTGGGAAG CTTGCCATTC CTGATCATAG CCTACAAACT CTTCCTGCCT 
' 19601 CCCACTCACC CTCATCTCTG CTGTCAAAAT GCAACCTTCC CTCAAGAGTC 
19651 ATTTCACAGG ACCCCTCTTT CTATGAAGCC CTCAGGTGGA AATAAI I I I I 
19701 TGCCI I I I I I TCCATTTTAT TTTTGGAGTG TTTATGGCAT TTAACATACC 
19751 TTACTTTGTA TACAAATATT TGCCTTGCTC CCTCTTTTGC AAATTTCTTA 
19801 AAGGTAGAGA CCATTGTATG I I I ILI I CAT ATGTTGCTGG TGCCTAACAG 
19851 AACTATGGCC ATTGTCCACA TTCATTTAGC AGCCTTTGTA GTTATTGCTT 
19901 TGAGGAGCTT CCTCTCATGA ATGCCCTTGC TTTCTCTCCC ACAGAGTCAT 
19951 CCCCCTATAT ATGACCTGAC TGCCATGAAA GTGCCTACTG CTATTTGGGC 
20001 TGGTGGACAT GATGTCCTCG TAACACCCCA GGATGTGGCC AGGATACTCC 
20051 CTCAAATCAA GAGTCTTCAT TACTTTAAGC TATTGCCAGA TTGGAACCAC 
20101 TTTGATTTTG TCTGGGGCCT CGATGCCCCT CAACGGATGT ACAGTGAAAT 
20151 CATAGCTTTA ATGAAGGCAT ATTCCTAAAT GCAATGCATT TACTTTTCAA 
20201 TTAAAAGTTG CTTCCAAGCC CATAAGGGAC TTTAGAAAAA ATGGTAACCA 
20251 ACAATGAGGT TGTCCCCCAG CACCCTGGGG GAGATGCACA GTGGAGTCTG 
20301 TTTTCCAAGT CAATTGTGTT AGTGTTATTT ATGTTTAGAG ACATCTTTGC 
20351 ATGGGACCAT CTACAGGTCC TTATAAACAA TGAGGTAGAT TAGGCAAAAA 
20401 GATAAACAAG TTGCTACTCT ATCTGGCATT TAAGTCTAAT TAAATTGTAA 
20451 I I I I IAGGGC ATACCATGAA GTATAGAAAT GTCTGAAGCT TCAAAGGAAC 
20501 AGTGAAATTC CTTTAAGGTC CTATATGGAA ACCTCTGTTG TCATTTTATT 
20551 TATATGGATT GCTATGGCAA TGGACAGAGT GTGGGATTAG GAGGAGGGCC 
20601 TGTAACTTCT TTATAAAAGT TTCTTAGCTA TCCTGAAGAT GTATAGACAT 
20651 TTTTACTTTT TTAGGTATTT TCAACATCAG AAATTCAAAA AAGTCCCCAA 
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20701 AGATTCTTCC AGAGAAGCCC TCI 1 1 1 <_ I IA CAATCTTATC CCTGGCTATC 
20751 TGCGTAAACG GAATCTTGAA CCCATAATAG GATACATGTA TAAAATCTTC 
20801 CTTATTAAAG CAGAAATAAA TTGTACAGCA TCAATATCAT TTTATAATCA 
20851 TAGGGAGGCT TCTTTGTTTA GCATGTAATG CCCCCTTTAC AGGCTTTTTG 
20901 TTCTTTGAGG GGTTTGAACA TTCCATGAAA AACTGACAGA TAGGAAACTG 
20951 ACAATAAAAG ATTGAGCTAA AGATGGAAGC AGAAAGTACT AGGCTAGATA 
21001 GTCTCTAAAC ATTAAGTATT TTCTTCCTCC ATCTTAAAAG CAATGAGAAG 
21051 CCACCAAAAT ATTTTACCTA ATGGAAACCT GATTGCCGCA TTTTTGTAAC 
21101 CACCACTTTG GCTGCTACAT AGAGAATGGA TTAGAAGATG CCAACAAAAG 
21151 ATTCTGAGCA AGTCTGTAAA TCTGATCAAG TGTTCTGATG CAGGCTGATA 
21201 TCCTTCTGTG CTAAGAGAGA TGATCCTTGG AAAATCCAGA GCCAGCTCCA 
21251 TAATACTTTC CTGCTCTGCT GGCAAATCCA CAAGCTGCTG GCCCCTGGAG 
21301 CCATTCTTCT CTCAAAACTA GCATTCATCA ATTTAATGTA TACGTATTGA 
21351 TGGGGAATAA TGGTCACTAT GAAAACCATG TGATAATATG GAAAAATACC 
21401 CATGATATAA TGTTATGTGA AGAGAAGAAA ATGAAACTGG TAGAACTATG 
21451 TGATTGCAAA TATATACAAA TATTAAAACA ATTATATGAC TTTATAAAAT 
21501 ATTTGTATAT AATGAAAACT GAAGCAATAT AAAAAATAAA ATTAGTTGTG 
21551 TCAGGGTAGT AACATGATGA GTGATTAATA GTTTTTAATT TTTAATATAG 
21601 TAATGACATA ATGTTACAAC TTGTCCAAAT CTCACAAACA TAATATTCAG 
21651 TAAAGGAAGA TAAACATAAA AGAATACATA TTTTATTATA CAI I I I IATG 
21701 TAGGCTAATT GATGGTTCTG AAAGCCTTAA AAAGCTTACT TTTAGGAGGA 
21751 GAATCATGCC TTGGAGGACT CTAGGGTCCA GAAAAATGTC CTAATACTAG 
21801 AGCTAGGTGC AGTCAGATTA ATTATAATAC ATTTCATTAT TTTGTCTGGA 
21851 ATACCAAGAT GACTTCCAAG CAGGAATGGA GTCTAGCAAC ACTTTACTGA 
21901 TGGGGAACTT GGCCACAGAC TTGTAATACA AAI I I I IGGA TATGTTGACA 
21951 ATGTTTCTCC TTATTTTTCT TACTTATACA AAGCAAGAAA TTTGGCTCAC 
22001 AACCTTGAAA CAGACTTACC AGGTTCCTCC AGTTTCCCAA GCCTCAATAT 
22051 CTCATTGCTA TTTTTAA 
(SEQ ID NO: 3) 

SNPs: 



DNA 

Posi ti on Major Mi nor 

165 G A 

226 A G 

231 T C 

359 A 

544 G T 

598 C T 

1621 A G 

2330 C T 

2498 A G 

2791 T C 

2877 T C 
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2879 
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2912 
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3076 
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3745 
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T 
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C T 
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4399 
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4945 
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5280 
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5790 
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5901 
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6457 
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6632 
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6763 
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6955 




T C 


7017 
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7151 
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T 


7308 
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7321 
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7542 
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T 


8597 


T 
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8803 


C 


T 


9016 


G 


A 


9967 


T 


c 


10008 


C 


T 


10363 


G 


A 


10684 


T 


c 


11177 


G 


T 


12345 


T 


C 


12349 


C 


T 


13115 


C 


T 


13354 


T 


A 


13373 


C 


G 


14677 


C 


G 


14734 


G 


A 


14747 


A 


G 


14808 




A 


15086 





A G 


15414 
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15722 
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15861 


T 


C 


16264 


A 


T 


16314 
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A 


16877 


A 


G 
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16966 T G 

17147 A G 

17219 T C 

18628 A G 

18655 T G 

18984 G T 

19407 C T 

19531 T C 

19911 C T 

20199 A G 

20243 G A 

20640 T C 

21156 G C 

21163 A T 

21425 G A 

Context: 
DNA 

Position 

165 TTATGGCCTAACC I I I I I AACTrTGAGTTATTTTCAAGAGAAAATTTGAAAAAGCAGCCT 

TTGAGGAGAMGMGCMTCCMCAMCAAAMGATMCCACACTGTMTAGQ 
TTTTGMTAGGACATTGGAAGAAAAATAATAATCA I I I I IACAG 
[G,A] 

TAGATCCCAMCTCAAGGATCTATGTTCAACCATCTCT 
ATGAGTMCCATCATTMGCAGTTAGCTTAGGCCGTMTATGATTCITGGAC^ 
CAAAMTACCACAGGCCTTCTGAMGGTTACCCCTTTCT AGCTCCACTATCATCTAATTT 
TATTAAAAAAAAAAAAAMGGAAAMTTTGAGCTTCrAGAGAGTAGGCXCT 
TATCCCACAGGGCCMC^GAACAAGTTTTMTGTATTCATTTAA^ 

226 TTATGGCCTAACL I I I I I AACTTTGAGTTATTTTCMC^(WW\TTTGAAAMG^GCCT 

TTGAGGAGAMGAAGCAATCCMCAMCAAAMGATMCCAOXCTGTMTA 
TTTTGAATAGGACATTGGAAGAAAAATAATAATCA I I I I lACAGGTAGATCCCAAAGTCA 
AGGATCTATGTTCMCCATGTGTGTTCCACCATCXrCACAATTGA 
[A,G] 

TGAGTAACCATGATTAAGCAGTTAGCITAGG^ 

AAAMTACCACAOKiCTTCTGAMGGTTACCCCTTTCTAGCTCCACTATCATCT 
ATTAAAAAAAAAAAAAMGGAAAAATTTGAGCTrCTAGAGAGTAGGG 
ATCCCACAGGGCCMGGMCAAGTTTTMTGTATTCATTTAMTTM 
ATTGAMTATATMTAGAAATATTGTAACATTATA 

231 TTATGGCCTAACL I I I I I AACTTTGAGTTATTTTCMGAGAAMTTTGAAAAAGCAGCCT 

TTGAGGAGAMGMGCAATCCMO\MO\AAMGATMCCACACTCT 
TTTTGMTAGGACATTGGMGAAAAATAATAATCAI I 1 1 lACAGGTAGATCCCAAAGTCA 
AGGATCTATGTTCMC(^TGTGrGTTCCACCATCTTCACAATTGAATGAG 
[T,C] 



Docket No.: CL001186DIV-II 
Serial No.: (to be assigned) 
Inventors: Gennady V. MERKULOV et al. 
Title: ISOLATED HUMAN LIPASE PROTEINS.. 



FIG 3L 



Docket No.: CL001186DIV-II 
Serial No.: (to be assigned) 
Inventors: Gennady V. MERKULOV et al. 
Title: ISOLATED HUMAN LIPASE PROTEINS... 

MCCATCATTMGCAGTTAGCTTAGGCC^ 

TACC^CAGGCCTTCTGAMGGTTACCCCTrTCTAGCrCCACTATCATCTM 
AAAAAAAAAAAMGGAAAAATTTGAGCTTCTAGAGAGTAGGGGCTACC^ 
ACAGGGCCAAGGAACAAGI I I I MTGTATTCATTTAAATTAATTTCAGTATGAGTATTGA 
MTATATMTAGAMTATTGTMG\TTATATATTTTCTATATA(XrTTATTATATAG^ 

359 CTTTGAGGAGAMGMGCMTCCMOW^OWW^GATMCCACACrCT 

TGTTTTGAATAGGACATTGGAAGAAAAATAATAATCA I I I I I ACAGGTAGATCCCAAAGT 

CMGGATCTATGTT(^CCATCTGTGTrC<^C(^TCrrCAG\ATTGAATGAGrAACG\TC 

ATTMGCAGTTAGCTTAGGCCGTMTATGATTCrrGGACTGAGAT^ 

GGCCTTCTGAMGGTTACCCCTTTCTAGCTCCACTATCATCT 

[A,-] 

AAAMGGAAAMTTTGAGCTTCTAGAGAGTAGGGGCT ACCATTTTGTATCCCACAGGGCC 

AAGGMG\AGTTTTAATGTATTCATTTAMTTMTTTCAGTAT^ 

ATAGAMTATTGTMCATTATATATTTTC^^ 

TACACW^TATATTATTAAATATTGTAGMG^TATATMTACAGAAAAATATATAATACT 
CAGTMTATATTAMTACrrATTAAMTAG^GCrrATATAGGMGAGTGATGGAGGAT 

544 GCAGTTAGCTTAGGCCGTAATATGATTCT 

TCTGAMGGTTACCCCTTTCTAGCTCCACTATCATCTMTTTTAT^ 
AGGAAAAATTTGAGCTT CTAGAGAGTAGGGGCTACCATTTTGTATCCCACAGGGCCAAGG 
MC^AGTTTTMTGTATTCATrTAMTTMTTTCAGTATGACTATTGAMTATATM 
AMTATTGTMCATTATATATTTTCTATATACTITTATTATATAGAAMTATATATTA^ 
[G,T] 

MTATATTATTAMTATTGTACW^CAATATATMTACAGAAAAATATATAATACTCAGrrA 
ATATATTAMTACTTATTAAMTAGCMGCTTATATAGGAAGAGTGATGGAGCATTCTGA 
GAAAGTTTCAGCnTA I I I <_ I I I GACATTAL I I IGI I I CTGCACAAACAAAAGAATTACA 
GGMTTGrCCAGATTATTCAMTMCTCGMGrrGAGGAGGGMTATMGTCMTGATCT 
AGAMCrcrrTTMGATTTGAGCTAGCCTACMTCrGTAMGAT 

598 AGX£CTTCTGAMGGTTACCCCTTTC^^ 

AAAAAMGGAAAMTTTGAGCTTCTAGAGAGTA^ 

CCAAGGAACAAG I I I I MTGTATTCATTTAMTTMTTTCAGTATGAGrATTGAAATATA 
TMTAGAMTATTGTAACATTATATATTTTCTATATAL I I I I ATTATATAGAAAATATAT 
ATTACAGAATATATTATTAAATATTGTAGM(^TATATMTACAGAAAAATATATAATA 
[C,T] 

TGVGTMTATATTAMTACrrATTAAAATAGCMGCTTATATAGGAAGA&TGATGGAGCA 
TTGTGAGAAAGTTTCAGCnTA I I ICI I I GACATTAL I I IGI I I CTGCACAAACAAAAGA 
ATTACAGGMTTGTCCAGATTATTCAMT^ 
TGATCTAGAMCTCTTTTAAGATTTGAGCTAG^ 
GMCTATATTTGTGXTATTTCCATATTM 

1621 CGGCTTMGCTCCACAGGCATACAMGTGAAGCAGAAMCT 

TATCTGGTATCTCATGTGXiGGCrTAGAGGTAMTTGTCGlTATTTGGCCT 

CrTTMCCACTGGTGTAMOV\AGGTTACTGTGCCAMGTTGAC^ 

TTGGCATGTGMTTAGTTTCCTCTGCCATACTGCTAGTTCCAAATTCOT 
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GATTTAGGAGTCAGGGTTGCCTCATC^ 
[A,G] 

CACTGCATGGTTGGCACTAGTTCCTTGATAT^^ 

TCAMTGGGGMGGGAGATACTATTGTCTCTG^ 

TACTTCCCTGTTTGACACACTGGrrrrGAAMTGTTGCTM 

TACTCAGTGGAMCATGAAGGATTCCGTCAMCT 

ATTTCCCMCCTGCMGTGCATCATGGCCTrTGGTGTGT 

2330 AAAAGTTCAGMGTTCCTCATCMTMGGAGTC^ 

TAGGTMGATGMGATCTATCATMCCAGGAGGCAGGTTG^ 

(^GTCAGGTGCMGAGCTCTGaVGTGAGGCrGCCrGAGrGrCCATCCTAGATCrCTCACC 
TCTTGGCTCrGTGACCrrGAGCAGGTCrrAMTCrCrCTAAGCL I I IGI I I I I I IAATTG 
ATAAMTGAGGATMTMTAGTACCAAMTTAGGGAGATTTTCAGAGCITAM 
[C,T] 

GTGMCTATTTAGAGTMTGCCTGCCATMGGGGACTC^ 
ACAATTTGAAAAGTTTCATMTATTTGCAGATATMGATGATOT 
TGTATGCAMGCTATTTAGCTTCAGAACT 
TTTCTTATACTGMTTATCTCTAATAT 

GAMGCATTTATrMGMOTACACTGCACTAMTGTTATAW 

2498 AGATCTCTCACCTCTTGGCTCTGTGACCrrGAGCAGGTCTTAM 

I I I I I I I MTTGATAAMTGAGGATMTMTAGTACCAAAATTAGGGAGATTTTCAGAGC 

TTAMTMCATACGTGMCTATTTAGAGTMTGCCTGCCATM 

TTATTAGTTTCATACMTTTGAAMGTTTCATMTATTTGC^ 

ACCAGATAGCTMTGTATGCAMGCTATTTAG 
[A,G] 

GTTAAATATTAC I I IGI I ATAGTGAATTATCTGTAATATTTATCTCTT GCTCACTTTTAT 
MGAAAMTAGTGAMGCATTTATTMGMCTTACACTGG^CT 
TAATCCTCACTATMCCCTATGAGATAGGTTACATTATTGTCCTMTTTTACT 
AMCCMGAGACAMGCTACTAAMCACTTGCCTGAGGTTAGACATC I I (J ICTGTGGTG 
AGGCTGGATTTCAMTTTAGACCXrTTGACT 

2791 TTCTAGAAGTTAAATATTAC. I I IGI I ATAGTGMTTATCTGTAATATTTATCTCTTGCTC 

ACTTTTATMGAAAMTAGTGAMGCATTTATTM 

ATATGACTTMTCCT^CTATMCCCTATGAGATAGGTTACATTATTGTCCTMTTTTAC 
TMCMGGAMCCMGAGACAMGCTACTAAMCACTTGCCTGAGGTTAGACATC I IU I 
CTGTGGTGAGGCTGGATTTCAMTTTAGACCATTTGACTGTAGCACTTATAT 
[T,C] 

GCrGTTTAGTGTTATAGTGTTGGTCTACCTTTGAATAGACATACTTTTAM 

GGAAGTGAGACTGCACATTGAMTATGTAAMTTTGCCTTTGGGTG^ 

GTCACATCACTAGAMCTAATCATMGCTT^ 

I I I ICI IGI I I ACTTTGTGGGATACTGGGCTTMCTAGGGGATACCTCCAL I I I I IACTT 
GGCCATGGTATGAAAACCTGTCCTCTGMTCTTTAGATATTTTGGCAMT^ 

2877 ATTTATTAAGAACTTACACTGCACTAMT^ 

TATGAGATAGXJTrACATTATTGTCCTMTTTTACTMCMG 
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ACTAAAACACTTGCCTGAGGTTAGACATL I I CI I CTGrGGTGAGGCrGGATTTCAAATTT 

AGACCATTTGACTGTAGO^OTATATGATGAGC^TGCTGTTTAGTGTTATACTGTTGGTC 

TACCTTTGMTAGACATACTTTTAMCCATGGC^ 

[T,C] 

GTAAMTTTGCCTTTGGGTGCCACGTGAGAMTAGTCACATCACTAGAMCT 
GCTTTTGTGrrTGGrrTAAAGTTTTATTGATCCA I I I I I CI It. I I I ACTTTGTGGGATACT 
GGGCTTAACTAGGGGATACCTCCAC I I 1 1 I ACTTGGCCATGGTATGAAAACCTCTCCTCT 
GMTCXTTAGATATTTTGGCAMTTGTAGGCAMCAMGACTTAM 
ATTAAMTAAGACCAAAMTGCCTCCATACTTG^ 

2879 TTATTMGMCrrACACTGCACTAMTGTTATATATGACTTM 

TGAGATAGGTTACATTATTGTCCTMTTTTACTMCAAGGAMCCAA 
TAAAACACTTGCCTGAGGTTAGACATC 1 1 CI I CTGTGGTGAGGCTGGATTTCAAATTTAG 
ACCATTTGACTGTAGCACTTATATGATGAGCArGCrGTTT 

CCTTTGAATAGACATAC I I I I AAACCATGGCAAGGAACTGAGACTGCACATTGAAATATG 
[T,C] 

aamtttgccrttgggtgccacgtgagamtagtcacatcactagamct 
ttttgtgtttggttaaagttttattgatccai i i i ici igi i i actttgtgggatactgg 
gcttaactaggggatacctccac i i i i i acttggccatggtatgaaaacctgtcctctga 
atctttagatattttggcamttgtaggc^^ 
taaaatmgaccaaamtgcctccatacrrgattaaatttatttcattt^ 

2912 tatgaotmtcctcactatmccctatgagataggttaca™ 

mcmggaaaccmgagacamgctactaamcacrrgcct i i ci ic 

tgtggtgaggctggatttcaaatttagaccatttgactgtagc^ 
gcrgtttagtgttatagtgttggtctacctttgaatagacatac i i i i aaaccatggcaa 
ggaagtc^gactgcacattgamtatgtaamtttgccrtt 

[A,G] 

TCACATCACTAGAMCTMT^TMGCTITTGTGTTTGGTTAAAGI I I I ATTGATCCATT 
I I ICI IGI I I ACTTTGTGGGATACTGGGCTTAACTAGGGGATACCTCCAC I I I I IACTTG 
GCCATGGTATGAAMCCTGTCCTCTGMTCTTTAGATATTTTGGCAMTT 
AMGAOTAMGCAATTCMC(XTGATTAAMTMGACCA 
TAMTTTATTTCATTTTAGGMCTG^ 

3076 CTTATATGATGAGCATGCTGTTTAGTGTTATAGTGT^ 

TTTTAMCCATGGCAAGGMGTGAGACTGCACATTGAMTATCT 
TGCCACGTGAGAAATAGTCACATCAC^^ 

AGTTTTATTGATCCA I I I I ICI I CI I I ACTTTGTGGGATACTGGGCTTAACTAGGGGATA 
CCTCCACI 1 1 I I ACTTGGCCATGGTATGAAMCCTGTCCrCTGMTCTTTAGATATTTTG 
[G,T] 

CAMTTGTAGGCAMCAAAGACTTAMGC^ 
TGCCTCCATACrTGATTAAATTTATTTCATTTTAGGAACTG^ 
TCTACATGAAAAMTAGATTMTAGTGCTCCMGTTAGTT(^CTGTATTTATTCC I I I I I 
ATACATTATCTGCCTTCGGTGTTAT^ 
CATTTTATTTCATTMTCMCATTGATAGTTAAMTTM 
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TGGTGGATTCCTTGATTTGGAAMTGMGTGMTCCTGAGGTGrGGATGAATACTGTAAG 

TCATGGAAMCTGTGMGAACATOW^TAMGG^GGACTMTGGAGTATGAGGTTACGAA 

AGGTCCTGTTGTAACAGAAAATCrCTGATAAAACAGATAAAATGTAGATGC. 1 1 1 1 IAACC 

TCTGCMGAGTCMGCTAGTTAGATCnTGTC^^ 

MCCAMTTGTGCTATTGTGCTATCTAT^ 

[C,G] 

TATCTATCTATCTATTTATCJATCTATCTATAGATAGM 

TTTAAGAATATCAAGCTA I ITG1 I GATATACATGATTGCCTTCTATTGATCTATAGTTCT 
ATTACTTTTAMGCMGAGGGGTCTCAAMG^ 
GAMGAATGGGTCMTGCTAMTTTrCCCCCAACCCCCCAAMTA™ 
TATTTTTTAAAATTCTACTTATTTTGTATTMG^ 

TTCCTTGATTTGGAAMTGMGTGM^ 

AAACTGTGMGMCATCAMTAMGCAGGACTMTGGAGTATGAGGTTACGAAAGGTCCT 

GTTGTAACAGAAMTCTCrGATAAAAGVGATAAAATGTAGATGG I I I I I AACCTCTGCAA 

GAGTCAAGCTAGTTAGATCTTTGTCTGAAAMCAMTACrCT 

TTGTGCrATTGTGCrATCTATCTATCTATCTATCr^ 

[T,-] 

. CTATCTATTTATCTATCTATCrATAGATAGMCCTCCTCITTTGM 
ATATCAAGCTA I I 1 0> I I GATATAC^TGATTGCCrrCrATTGATCTATAGTTCTATTACTT 
TTAMGCAAGAGGGGTCTOW\AGACMT^^ 

TGGGTCMTGCTAMTTTTCCCCCMCCCCCCAAMTATTAGCCMTAGTAGATAI I I I I 
TAAMTTCTACTTATTTTGTATTMG^ 

TGGAAMTGAAGTGMTCCTGAGGTGTGGATGMTACTGTMGT^TGGAAMCTGTGAA 
GAACATCAMTAMGCAGGACTMTGGAGTATGAGOT 

AAMTCTCTGATAAAACAGATAAAATGTAGATGOI I I I I AACCTCTGCAAGAGTCAAGCT 

AGTTAGATCTTTGTCTGAAAMCAAATACTGTCCGGTMTGAAMCCAM 

GTGCTATCTATCTATCTATCTATCTATCTA 

[-,C,T] 

ATCrATCrATCrATAGATAGAACCrCCrCTTTTGAATTTATGTTTT 

atttgttgatatacatgattgccttctattgatctatagttcta™ 
agggctcto\amgagv0tgaot 

cramttttcccccaaccccccaaaatattagcgv\tagtagata i i i i i iaaaattcta 
cttattttgtattaagactitatttattaatttt^ 

aaagcaggactmtggagtatgaggttacgamggtcctgttgtmc^ 
taaaacagataaaatgtagatggi i i i i mcctctgcmgagtcaagctagttagatctt 
tgtctgaaamcaaatacrgtccggtmtgaamccamttgtgctattgtgctatcrat 
ctatctatctatctatctatct^ 

ctatagatagmcctcctcttttgmtttatgttttmgmtatc^ i i i g i i gat 

[A,G] 

TACATGATTGCCTTCTATTGATCTATAGTTCTATTAC^ 
MGAO\ATTGACTTGATMTATAGCTT^^ 

CCCAACCCCCCAAAATATTAGCCAATAGTAGATA I I I I I I AAMTTCTACTTATTTTGTA 
TTAAGACTTTATTTATTMTTTTACAGTTACCT 
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CTAATAAGCACACAACAGATGCj I rTGTTI I GATTCL 1 1 I 1 1 ATATCCT7TGGAGAAGTTC 

4399 GTTTTGATTCL I 1 1 1 I ATATCCTTTGGAGAAGTTCCACTAACGACrGTA I 1 1 I IACTGGG 

(^GACTGAMTCAT(^TCTA(^TGGCTACCCCAGTGMGAGTATGMGTC^CG^CT^ 
GATGGGTATATACTCCTTGTCMC^GMTTCCrrATGGGCGMCACATGCT 
GGTACAAGATATGTCTCrCCrGAAMGGGGACTGCATTGACCTCCTGCT 
ATTTAATGCTAGATATGCATCAAG^GAGTTT ATCAAAATTGGTTTGAATTATTGGATTAG 
[T,C] 

CTTTAMTAGTTATCAGGGAGGCTCACTCTTTGCCTGATMTTCT 
MCCTAAAMTACAMCAGCMGACTGATCrrGCTMCTGCAACCAGAGGTAL I IGI I AG 
GGTGTAMCAGAMGGCAGAGCCTGCATTTTGTCACCTCATTACTGATTTATCATCT 
AAATTGCTTTGTCCCAGGAAAATGGATCCTCTCATTGTCAGMGGAGATTTTCT 
TATGAMTTGACTCTGGGGCACCCMGMGMCCTCTCCTGCrCCCACTAAMTTAAGGG 

4945 MTTGACTCTGGGGCACCCMGAAGMCCrCTCCTGCTCCCACTAAAATTAAGGGGCCTC 
CCTCTGCAGGATAAAAMCMTCTAGTTAMTGACAACGCATTTCTGAAAAG I I I ICCAG 
GACTGAAMCCTTAACATCCACATACACTTTGATCTAAGGGACAGACGGTTCATAGAATG 
AMGAGTATGGTGTCAATAAGGCTTGAATTCTAGAATGAGGAGCCAGCCATGCCATAGCA 
GGGGMTGATACTCC1TAAMGGGAAMTTTMCTACAMTCCTCTGMGTAGAAATGAT 
[A,G] 

AGMTMCCAAMTATCTGCMTGGTTCMTAGCAAATAATTTATTGGCAGCTGCTTACC 
GTGTTCATTTTGCATC I I I I I I CCCACCACACATATTAAGGAGCAGCTGAAGTCATGTTT 
GACATTCTCTCCCTCTTTTATCTCCAGTTTCAGMTGAAAMTGAGAGTGAGATATGAGT 
AGI I I lACTAGTTAAMTATGAMCACCCAGTTAMTTTGMGGTCAGATAAACAACAAA 
TMTTTTGTATMGTCTCAITTTMGATMTACTAAAMGTCA 

5056 GIN ICCAGGACTGAAMCCTTMCATCCACATACACTTTGATCTAAGGGACAGACGGTT 

CATAGMTGAMGAGTATGGTGTCMTMGGCTTGMTTCTAGAATGAGGAGCCAGCCAT 
GCCATAGCAGGGGMTGATACTCCrTAAAAGGGAAMTTTMCTACAMTCCTCTGMCT 
AGAAATGATMGMTAACCAAAATATCTGCAATGGrrT CAATAGCAAATAATTTATTGGCA 
GCTGCrrACCGTGTTCATTTTGCATC I I I I I I CCCACCACACATATTAAGGAGCAGCTGA 
[A,G] 

GTCATGTTTGACATTCTCTCCCTC I I I I ATCTCCAGTTTCAGAATGAAAAATGAGAGTGA 
GATATGAGTAGTTrTACTAGTTAAMTA^ 

AACAACAMTMTTTTGTATMGTCTCATTTTMGATMTACTAAAAAGT 
TCACTATrATCACTATTTATAAAATTTTGTAGAGCATCCTGGATC I 1 1 I IGCTTACTTTT 
Gl I I I IAI I I I I I GCTAAATCTGGCAATCCCAGGCACATGTGTGAAGGAGCTGTGAAATA 

5280 AAATMTTTATTGGCAGCTGCITACCGTGTTCATTTTGCATCI I I I 1 1 CCCACCACACAT 

ATTMGGAGCAGCTGAAGTCATGTTTGACA7TCTCTCCCTGI I I I ATCTCCAGTTTCAGA 
ATGAAAMTGAGAGTGAGATATGAGTAGTTTTACTAGTTAAMTATGAM 
AATTTGMGGTCAGATAAACAAOW\TMTTTTGTATMGTCTCAT^ 
AAAMGTCATTATTTATTCACTATTATCACTATTTATAAMTTTTGTA 
[T,A] 

CI I I I IGCTTACI I I IGI I I I IAI I 1 1 I I GCTAAATCTGGCAATCCCAGGCACATGTGTG 
AAGGAGTTGTGAMTATAAMGX^GAAMCTTTTATG^ 
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ATMTTTTGGAMGATTTAGMTTAMGATCATTCATTAGATGTMTG 

TATATCAGnTTAMGTCTCATCAACAATATGAGATGGGTA^ 

AGVV^TGATGAMTTMGGCACAACCGGITATGITMGAGGCCTAM 

5790 TGAGATGGGTACCACTMTAGTCACCATTTCACAMTGATGAM 

TATGTTMGAGGCCTAMGTCCACA^ATAGCMGCrGACAGACCAGMTT^ 
CATGCTGGCTCCAGAGCCTGTGCTCTrACTCATTAMTTATAGTGC(^ 
CACCCTGGTTACrrTGGATCrCCCrGAATGCTCTCTCTCCCT(^ 
GCAGAGGGACACTGAGCTGAGCATATTATTGTAG I I I I I AAATGCTCTCCACTGGACAGA 
[A,G] 

GATGGGGGATTTGMTAGAMTTTGGTGAGGMCT^ 

CCTCTTCCTCCCTGGMGAGCrATAGGACTTGAGTO 

TAMCCACACCCAGGAMTTTGTATATACAMTACATAGAG 

GACTTTGACATAAAMGMCTGGGTTTGAGrCCCTGCrCTGGCLI I CI I ATCTGGGTGGC 
CCrCTGGGAAAGTTACTTAACTACATAAACI I I IGI I I CCATATCTACAAAATGAGCTTT 

5901 MGCCCAGGCATGCTGGCTCCAGAGCCTGTGCT CITAGTCATTAAATTATAGTGCCTTAC 
TTGACCTTCCACCCTGGTTACTTTGGATCTC 

TGGMGTTGGCAGAGGGACACTGAGCTGAGCATATTATTGTAC I I I I I AAATGCTCTCCA 

CTGGACAGMGATGGGGGATTTGMTAGAAATTTGGTGAGG 

ACACTCACCTCCTCTTCCTCCCTGGMGA 

[C,T] 

GTGTCTTTGTAMCCACACCCAGGAMTTTGTATATACAMTACATA 
ATCAGGACAGACTTTGACATAAAMGAACTGGGTTTGAGTCCCT I l(_l I AT 

CrGGCTGGCCCTCTGGGAMGTTACTTAACTACATAAAG I I I IGI I I CCATATCTACAAA 
ATGAGGTTTCTCAAMTAGCAGCrAGTTTATAGAG I IGI I GCAAGAATTTAGTAAGCTAA 
TACATATAMTACGTCMCATAGCACCAG^ 

6457 CMCATAGCACCAGXJTACAAAMTATGTGCTCMGAMCTGMGTTACCT 

CTCXATACTATTGACMGGGAAMGTGAAAACAG I I I I IGI I I I ACCATGTGTGTATGTG 

TGTGTGTCTGTGATGTTTCCGACATGCTCTATTTMCA 

TCTCTCTCTTTCTCTTTCTCCCT^ 

AGTTGTGTATATGCAGCATGCCCTGTTTGC^ 

[C,T] 

MTGGMGCCTTGGATTCCTTCTAGCAGATGCA« 
CGGGGAMCACTTGGTCMGMGACACAAMCACTCTCAGAGAC^ 
GCCTTTAGGTAMTATTAGCTMGAAMCTCMGGGGGAMTTGGAGGCA^ 
MTMCGTGGACGCTATTMTGATTATCTTTGACGCrrGMGTCATAT^ 
TTTCTGTTMGATCTCAMGGAGGGTMCAGCMGMGCTCTGA I I I I I CACTGATTCTC 

6632 TTCTCTCTCrCTCTTTCTCTTTCTCCCTCTCT 

CGGCCAGTTGTGTATATGCAGCATGCCCTGTTTGCAG^ 

TATGCCMTGGMGCCTTGGATTCCTTCT AGCAGATGCAGGTTATGATGTATGGATGGGA 

MCAGTCGGGGAMCACTTGGTCMGAAGACACAAMCACTCT 

TTCTGGGCCTTTAGGTAMTATTAGCTMG^ 

[T,A] 
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AAAAAMTMCGTGGACGCrATTMTGATTATCTTTGACGCTTGAACrrCATATAGCT 
TGTAGTTTCTGTTMGATCTCAAAGGAGGGrMCAGCMGMGCTCTGA^ I I I ICACTGA 
TTCTCCGVCMGCAMGTATGGCATTTCMCMGATCA I I I I I ACATCCAATTCTGTGAA 
TTCTATGCATTAAMGTATGTCCAMGAGACAGCT 
ACATTCATTCAGCCAATGTTTACTGAGT^ 

6763 MCCCTTGGATTCCTTCTAGCAGA 

AMCACTTGGTCMGMGACACAAMCACT^ 

TAGGTAMTATTAGCTMGAAMCTCMGGGG^ 

CGTGGACGCTATTMTGATTATCTTTGACGC^ 

GTTAAGATCrCAAAGGAGQGrAACAGCAAGAAGCTCTGA I I I I I CACTGATTCTCCCACA 
[A,G] 

GCAMGTATGGCATTTCAACAAGATCA I I I I IACATCCAATTCTGTGAATTCTATGCATT 

AAMGTATGTCCAMGAGACAGCTCAGGAMTTATC^^ 

GCCAATGTTTACTGAGTGGCTACTGTATGCGCTGTTCTAGGCCCCGM 

GMCAGACAMCTCTGACCrCACAMGCTTATGTTCATTT^ 

ATTGCTCCTGGATTGCCAATCAACTGTGTAMGATGATTTGGA 

695 5 TAATGATTATCTTTGACGCTTGMGTCATATAGCT 

AAGGAGGGTAACAGCAAGAAGCTCTGAI III I CACTGATTCTCCCACAAGCAAAGTATGG 

CATTTCAACAAGATCA I I I I I ACATCCMTTCTGTGMTTCTATGCATTAAAAGTATGTC 

CAMGAGACAGCTCAGGAMTTATCATGACCMTGTGCACATTCATTC^ 

CTG^GTGGCTACTGTATGCGCTGTTCTAGGCCCCGAACATTCAMCA 

C-.T.C] 

TCTGACCTCAaW\GCTTATGTTCATTTTAGTGATMTTTTACMCT 
TTGCCMTCMCTGTGTAAAGATGATTTGGACCAGGA 
GATTGATTTAGAGAMCTGAGATCGCACATAGTACCATTTTCAGG^^ 
GATTTTTAAAACC I IGI I AATGGGCAATGAAGAAGAATC I I I I I I GATATC I IGI I ICI I 
TTMTGGMGAGTTTTCTGCTGTCACCAGAGGACAGGCTGATGCCTGC 

7017 GGAGGGTAACAGCAAGAAGCTCTGA I I I I I CACTGATTCTCCCACAAGCAAAGTATGGCA 

TTTCAACAAGATCA I I I I IACATCCAATTCTGTGMTTCTATGCATTAAAAGTATGTCCA 
MGAGACAGCTCAGGAAATTATCATGACCMTGTGCACATTCATTC^ 
GAGTGGCrACTGTATGCGCTGTTCTAGGCCCCGMCATTCAMCAGGGMC^ 
CTGACCTCACAMGCTTATGTTCATTTTAGTGATMTTTTACMCT 
[T,G] 

GCCMTCMCTGTGTAMGATGATTTGGACCAGGACCTTATTGATTTAGAGAMCT 
TTGATTTAGAGAMCTGAGATCGCACATAGTACCATTTTCAGGAAM 
I I I I IAAAACU IGI I AATGGGCAATGAAGAAGAATC I I I I I I GATATC I IGI I ICI I I I 
MTGGMGAGTTTTCTGCTGTCACCAGAGGACAGGCTGATGCCT I I ICI I 

TCTTCAGGCCTMGCTCCCTGTTGGTTTGTAAACCTGATGCT^ 

7151 GAAATTATCATGACCAATCTGCACA 

TGCGCTGTTCTAGGCCCCGAACATTCAMCAGGGMCAGACAMCTCTGACCT 

CTTATGTTOVTTTrAGTGATMTTTTACAAGTCAT^ 

GTAAAGATGATTTGGACCAGGACCTTATTGATTTAGAGAMC^ 



FIG 3S 



Docket No.: CL001186DIV-II 
Serial No.: (to be assigned) 
Inventors: Gennady V. MERKULOV et al. 
Title: ISOLATED HUMAN LIPASE PROTEINS... 

ACTGAGATCGG^^TAGTACCATTTTCAGGAAAACTCCAATATTAGAI I I I I AAAACCTT 
[G,T] 

TTAATGGGCAATGAAGAAGAATC I I 1 1 1 1 GATATL I IGI I l(_l I I I AATGGAAGAGTTTT 
CrGCTGTCACCAGAGGAG\GGCTGATGCCTGCGATAGAC I I I I CI I I CI ICAGGCCTAAG 
CTCCCTGTTGXTITTGTAAACCTGA^ 

AMCATTCAGTACCCACTGAM&TTTGAGMTAGTGGAGGAATAGM 

TCTGAG 1 1 C I I GGGCAGGGGCAAGCAT^GGAMTATTGMTCATTACTCTTTAGGAGGT 

7308 CTCCTGGATTGCCMTCMCTGTGTAMGATGATTTGG^ 

GAMCTGTGATTGATTTAGAGAMCTGAGATCGCACATAGTACCA^ 
CAATATTAGAI I I I lAAAACCI IGI I AATGGGCAATGAAGAAGAATC I I I I I I GATATCT 
TGI I ICI I I I AATGGAAGAG I I I I CrGCTGTCACCAGAGGACAGGCTGATGCCrGCGATA 
GACI I I I CI I ICI I CAGGCCTAAGCTCCCTGTTGGlTTGTAMCCrGATGCrAGAAG\GA 
[C,G] 

TGTGTATTCCTATTACATTMTAAMCATTCAGTACC 

AGGMTAGAATAGAATGTTATAGTCTGAG I I C I I GGGCAGGGGCAAGCATCAGGAAATAT 
TGMTCATTAGTCTTTAGGAGGTGTCACAACMTTCrCCrA 

ATAGATTTCCTCACATG I ICI I I I MTAMCAGGCTTCTAGCTTATGGMTACCTGATTT 
GACTAAATGTTATATAGGCCC I I I I G I I CCTCCTGTCTGMGMCAAAATACTAGTACTA 

7321 AATCMCTGTGTAMGATGATTTGGACCAGGACCTTATTG^ 
ATTTAGAGAMCTGAGATCGCACATAGTACCATTTTCAG^ 

TTAAAACCI IGI I AATGGGCAATGAAGAAGAATC I I I I I I GATATC I IGI I ICI I I IAAT 
GGMGAGTTTTCTGCTGTOkGIAG^GG I I I ICI I I CT 

T(ZAGGCCTMGXTCCCrGTTGX7TITGTAAACCTGATGCTA 
[T,C] 

TACATTMTAAMCATTCAGTACCCACTGAMGTTTGAGMTA 
AATGTTATAGTCTGAG I ICI I GGGCAGGGGCMGCATCAGGAMTATTGMTCATTAGTC 
TTTAGGAGGTGTCACMCMTTCTCCTAT^ 

CATGI ICI I I I MTAMCAGGCTTCTAGCTTATGGAATACCTGATTTGACT 
ATAGGCCC I I I IGI I CCTCCrGTCTGMGMCAAAATACrAGTACTATGGAATATTGGTA 

7542 GCGATAGAC I I I ICI I ICI I CAGGCCTMGCTCCCTGTTGGTTTGTAMCCTGATGCTAG 

AACAGACrGTGTATTCCrATTACATTMTAAMCATTCAGTACCCACTG 
ATAGTGGAGGAATAGAATAGAATGTTATAGTCTGAG I ICI I GGGCAGGGGCAAGCATCAG 
GAMTATTGAATCATTAGTCTTTAGGAGGTCT 

CCAATCTATAGATTTCCTCACATGI ICI I I I MTAMOXGGCTTCTAGCTTATGGAATAC 
[C,T] 

TGATTTGACTAAATGTTATATAGGCCC 1 1 I IGI I CCTCCTGTCTGAAGAACAAAATACT A 
GTACWGGAATATTGGTATATATTAMWAW 

(TACTACTMCAACATOTACTGAGCACCCACTGGCAGCCA I ICI I ICATACT 

ATTAMCCCCGTTAGO^GCCCCGTAMCCAGXTrACTACCCTGTTTATTTCC 

AMCATAGGCT(IAGAGCATTTCACTM 

8597 ATAAMCTGGTCAGGAGAMTTGTATTT(^TTGGACATTCACT^ 

TGTTTATGAGGGT CACTGTTAGGTGTG 1 1 I I I GAGGGTCAGTTTTCTCAGAGTCTTACAG 
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GAGrrCACCnTATGrrQGMTAAMCMCTGTTACTT ATAGTGCCCTCAATTCCCTGTC 

CTOOTGGGMTMCCCTACnACT^ 

TGTAGGGCAMCCTTTCCTGGGTCTCTGGTCACAG 

[T,C] 

TTCCCAGGMTMCATGTGTTCCAMTTCAMGAMTAAT^^ 
ATTCCCTOX^GCTGAAAAAGTAAMTTCMTGCCATGGAATATGGCr 
ATGTGCATCMTCATCTGTTCTCACAACCGWVTGGGA I I I I lAAAAAATAAAAGGGAA 
GGGCTT ATACCTATATTTAMCAMTTG^AMGGCATGGTTATA I I IGI I IGTGAGTTGG 
MCACACMGCITACTATMTAAATCAATTGAGCTTATCT^ 

8803 TMGTAGCTGTGAGCCTGCAGTGCACAG^ 
TGGTCACAGCAGCATATTGACTACGCT 
ATTCAMGAMTMTTCCACAGAGTMGTTTCTAGA^ 
ATTCMTGCCATGGMTATGGCTGAMCATMTAAAT^ 

CAACCCAAATGGGAI I I I I AAAAMTAAMGGGMGGGCTTATACCTATATTTAMCAAA 
[C,T] 

TGAAAAGGCATGGTTATA I I IGI I I GTGAGTTGGMCACACMGCTTACTATMTAAATC 
MTTGAGCrrATCrATTG^GrGrGTGATTTAGTATTTATGAAATAGCMCTAAATGTAAG 
CACTATGTAGAAATTTCTAAAG I I I I I lAAGCTGACAACTTACI I C_ I I AATTTACTTACT 
TTACTTMTTTACTTTACMTTTACrrTCCAGGTATTTTGGAAAGAM 
TTCCMGTAAMGTTGAAAGGMCCCACACTMTAAMGCTTTGMTTTGT 

9016 AMTGTGC^TCMTCATCTCrTTCTCACAACCCAAATGGGA I I I I I AAAAAATAAAAGGG 

MGGGCTTATACCrATATTTAMCAMTTGAAMGGCATGGTTATAI I IGI I IGTGAGTT 
GGMCACACMGCTTACTATMTAMTCMTTGAGCrrATCTATTCACT 
TATTTATGAMTAGCMGTAMTGTMGCACTATGTAGAAATTTCTAAAG I I I I I IAAGC 
TGACAACTTAC I I LI I AATTTACTTACTTTACTTMTTTACTTTACAATTTACTTTCCAG 
[G,A] 

TATTTTGGAMGAAATCAATMTCTAGTTCCAAGTAAAAGTTGAMGGMCCCACACTM 
TAAAAGCTTTGMTTTGTCATTGAACrTCCACTAAAGTTTCCAATTTTA^ 
CATGTGAMGTGCAATATTTCAGTTTAGGGAMTATTTTCATTATCACCACT^ 
TMCAMCATATATTCATTAGTATTTTAGATTGACAGGC^ 

CAGTTAGCATCAGTCAGCATATACTAAAAMGTATCAAAGMCTCATAGGAGATCAAAAA 

9967 GTTTCATTTAGGACATAAATA I I I I I AGTGACTG I IGI I I GCATnTGGACAGAGCAATT 

TCTGTTATGTMGGAGC^CCCACTCTTTGTAGGACATTTAGT AGGTCCCAGCCCATTAAA 
CAGGGCTCTGCAGTCAGCGTGACCCT^ 

TCTGGGGAAGTACTATTCCTGATTCAGAGTL I I I I I ATCAATTGTTCAGTCAATTATTTC 
AGTTCTTCTTTI I CTGGCCMGACAGTTTTMTGTTCCMCMGTGTTTCAGTACACACA 
[T,C] 

ACACACACACACACACACACACACACACACACACACATGCrAGTGGAGGCCCAGGAAGGG 
ACCTCTGGAMCCAMTTATATGGATATTCTCCCTAGCCTACCCAGTGTTGTGCTAATCT 
CCATCCTCACAGATATACAMGGGGTGCMTGCTACTG^ 

GATGCCTGGTCCTTACTGGGCCATCGTGGATGCTAGGGAAAGCCCLI I ID I I I IGGAAA 
CAGGGMGAGTCTAGAGGGTTGAAAMCACCCAGTMGACACrGGGAGCAGTGAAATT^ 
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10008 CATTTTGGACAGAGCMTTTCT 

TAGCTCCCAGCCCATTAMCAGGGCTCT 

CACACATTTCCAMCACCCTCrGGGGAAGTACTATTCCTGATTCAGAGTC. 1 1 1 1 IATCAA 
TTGTTCAGTCAATTATTTCAG I I (J I LI I I I I CTGGCCAAGACAG I I I I AATGTTCCAAC 
MGTGrrTCAGTA<^CACATA<^<^^ 
[C,T] 

AGTGGAGGCCCAGGMGGGACCTCTGGAMCCAMTTATATGGATATTCTCCCTAGCCTA 
CCCAGTGTTGTGCTMTCTCCATCCT^ 

AMGAGCAAAGCAMTGGAGATGCCTGGTCCTTACTGGGCCATCGTGGATGCTAGGGAAA 
GCCCCI I I CI I I I IGGAAACAGGGMGAGTCTAGAGGGTTGAAAAACACCCAGTAAGACA 
CrGGGAGCAGTGAMTTTCATTC^TAGTGAGAMGAAMCCTGTTAGMTM 

10363 AGCCTACCCAGTGTTGTGCTMTCTCC^^ 

CTGCTGAAAGAGCAMGCAMTGGAGATGCCTGGTCCTTACTGGGCCATC 
GGGAAAGCCCC I I I C_ I I I I I GGAMCAGGGMGAGTCTAGAGGGTTGAAAAACACCCAGT 
MGACACTGGGAGCAGTGAMTTTCATTCCATAGTGAGAMGAAMCCT 
TGGGTGATGCTGCAGAMGAMTCMTTCACCTCCTGTGACTGATTATTTGOT 
[G,A] 

CTCTGTGATTCATTCTGGCATCTCAGAGTTAGGGATGAMTGAGMTOT 

ACCCCATGCXTCGGAAGTTTACACAG^ 

CCCCTGCCAACTACTCCATTTCC^ 

AAMTTGGAGACTTGAGAGCAGAGAAGACTGMGGCAGATTATCrTTATAGM 
GMGACTTCCMTTCATCCCCAGTATG^ 

10684 TCTCAGAGTTAGGGATGAMTGAGMTGTTGCC 

ACACAGCAGTAGCTACrCCAGOVGCrTMCCATCACCTTTCCCCT 

TCCCCCAATCAAGTCAMCTGTCCA^ 

AGAGMGACTGMGGCAGATTATCTTTATAGA^ 

CAGTATGATCACGATAGMGGAAAAAATGACTAAGCAGAGCCCCAA I I I IGI I AGAAACA 
[T,C] 

TGCGTAAGTATTTA I I I I I AG\AGATTGTCTTATCTCCTGTTCTCTCAGGGTTTGTAGCC 
TTTTCCACCATGCCTGMCTGGCACAAAGMTCAAAATGMTTT^ 
ATCTCATTCAAATATCCCACGGGCA I I I I IACCAGGI I I I I I CTACTTCCAAATTCCATA 
ATCAAGGTAGGCTCCTTTCMGWVVTGTACCTGAGGATCTG^ 
TTATTATTTTOWVTCTACTGTA^ 

11177 TCCTTTCMCAAAATGTACCTGAGGATCTCATTTTGGA^ 
MTCTACTGTAMGTAAAAGTAGGAMTTTAOT 
GGTATGTGCrTGTGTATGTGTGTCCCTGCGTGTGCGCATCT 
GGTTCTGTMTACMTTTACTATACMGGTCATCAG<^GGCrGAGTATATCT 
CTAGCTGAACTGAGTGCTATATGACAACAAGGAI I I I I C_ I IGI I I ICCCAAGTGI I I I I I 
[G,T] 

TTCCATTTAGTCAGGTAGGTCMTGMTTCACATTGCCCAMTG 

ACCCATMTCACTGATGTGTCGAAT^ 

CCMTATGGAMCTTGCCCTMTMCTA^ 

GCTCAAGTATTAATTCAA^TATTTATTGGTTA 1 1 1 I I CAGGAGTTGAAAAAGTCATTTGG 
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TTGCCMTTGTGGATTTGGGATTTTATCTATTAAAGGG I I I 1 1 1 1 1 1 1 1 1 1 1 CTCTTTGC 

12345 TTTAAGTCCCATATCCTGCTC 1 1 I ICI I CCGTCAGTTTCCCCCAGAAGCTCCAAGACCCC 
ACCAGGMTCCCCATCCMGTTTACTTTCC^ 
TTGTGACATTATCATATCTTTTCT^ 

ACriTTCAGCCTGAGAGCrGGCrMTCrGGGACACTACTCGMTGGVGr 

TMCATGGAAMCCCCGATTTTCCCTTATATTG\AGGTATTATTTGA 

[T,C] 

GTTTTACATTTCATACCMTTMTGAGAA 

TACAGGGMGCTTCACTATGGAGMGTGMTTTGGGATTGAGGGCCrr^ 
CTTGTAMTMTATTTGATACTCTTCCT^ 

TGAATMTTTGGTCTCCTTGACTGAATCAGrAAGT ACAAATAGATCCCCAAGCATGGCTC 
TTTCCTAGMTGAMGAMTGTCAAGMGTC^^ I I I I I I 

12349 AGTCCCATATCCTGCTC I I I ICI I CCGTGVGTTTCCCCCAGAAGCTCCAAGACCCCACG\ 
GGMTCCCCATCCAAGnTTTACTTTCCCMCrCCrGGMGTTTCM 
GACATTATCATATCTTTTCTG1TCM 
TTCAGCCTGAGAGCTGGCTMTCTG^ 

ATGGAAMCCCCGATTTTCCCTTATATTCMGGTATTATTTGACCTTMGA 
[C,T] 

TACATTTCATACCMTTMTGAGAAAAAMTATTGGCAAGCACrGACT 
GGGMGCTrCACTATGGAGMGTGAATTTGGGATTGA 
TAMTMTATTTGATACTCTTCCTCA^ 
TMTTTGGTCTCCTTGACTGMTCAGTMGTAC 

CrAGMTGAMGAMTGTCMGMGTCrGMGATGATTCrTGMTTTTGG I I I I I IGCTA 

13115 TAGAAGATMGAAMCGMGATAGCITCTACCAAMTCT 

TGATATGTAGCGMTTTATGTCCTTATGGGCTGGATCCMCMGAAAMTATGM 

TATGTATGATMTTATAGGGCCATTTGATAC 

ATTTTGATATATCTATTTACTGTATAAATC 

| | | | I | | | I GCrTTTAAAMTGTTTATGGGTATATAATAGTTGTACATATTTATGAGACA 
[C,T] 

ATATATTTTGATATMGCATACMTGTGTMTGACCAMTCAG^ 
TCACCTCAAGCATTTATCA I I ICI I I I IGI I AGAGACATTCTMTTTGACTCTTCTAGTT 
ATTTTGAMTATACMTGMTTATTGTTMCrATAGTCATCCrATTCT 
TTAGTCCrrCTMCGGTATTTTGGTACCCATTAACCAATGCCTCTTTA 
CCCTACTACCTTTCCCAGCCTCT^^ 

133 54 Al I I I I I I I I GC I I I lAAAMTGTTTATGGGTATATAATAGTTGTACATATTTATGAGAC 
ACATATATTTTGATATMGCATACMTGTGTMT^ 

CATCACCTCAAGCATTTATCAI I ICI I I I IGI I AGAGACATTCTAATTTGACTCTTCTAG 

TTATTTTGAMTATACAATGAATTATTGTTMCTATAGTCA^ 

CTTTAGTCCTTCTMCGGTATTTTGGTACCCATTMCCMTGCCT 

[T,A] 

CCCCTACTACCTTTCCCAGCCTCTGGTMCCATCATTCrrCTCACTATCTCTA 

AG II I I I I I I I AAACTCCCCTATATGAGTGAGAACATGCAGTATTTGTC 1 1 I I IGTGCCT 
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GGCrTATTTCACTTMTCTMTGTT^ 

TTTCATTU I CI I ATGGCTGTCTATATCTACC^CATTTTATTTATCCACrCATCTGTTGAi 
TGGACACTTAGGCTGATTTCATATCT^^ 

13373 MTGriTATGGGrATATMTAGn"GrACATATTTATGAGACA(^TATATTTTGATATAAG 
(ZATACMTGTGTMTGACOWVTCAGGGTMTTGGGA^ 

CATTTCTTTTTG1 I AGAGACATTCTMTTTGACTCTTCTAGrrATTTTGAAATATACAAT 

GMTTATTGTTMCTATAGTCATCCTATTGT^ 

ATTTTGGTACCCATTMCCAATGCCTCTTTATCGTCCCCCACC^ 

[C,G] 

CCTCrGGTMCCATCATTCTTCTCACTATCTCTATAAGGTCAGI I I I I I I I IAAACTCCC 
CTATATGAGTGAGAACATGCAGTATTTGTC I I I I I GTGCCTGGCTTATTTCACTTAATGT 
MTGTTCTCTMTTTCATCCACATTAT^ I I C I I ATGGCT 

GTCTATATGTACCACATTTTATTTATCCACTCATCTGTTGATGGACACT^ 
CATATCTTGGTCATTGTGMTAGTGCTGTACTAM^TGGGGGT 

14677 AGAGATAGAGATCTMTTTCATTCTTCTGCATATGGATATCTAGI I I I CCCAGCATCATT 
TCTTGTGGAAATTGTCCTTTGCCCAATGTATG I I CI IGATGCCI I IGI IGAAAATTAGTT 
' . GACTATAAATGTGTGGATTTATTTGTGGG I I CI I I ATTCTGTTCCATTGGTCTATGTGTC 
TGI I I I I ATGCCAGTATCATGCAG I I I I GATTATTACAGGTTTGTAGTATAATTTGAAGT 
CAGGTCATGTGATGCCTCCAGC I I IGI ICI I I I I I CTCAGAATCTTATATTTAGAAAAAC 
[C,G] 

TAMGACTCCMCAAAAMCCTGCTAGMCTGATA^ 

ACMCATCMCATACAAAATTCAGCAG^TTTGAATATGCCMGAGCAAATM 

AAAMGAMGAAAAAAAMCMGAAATMTCCCATTTATMTAGCT 

ACACCTAGGMTAMCCATACCAMGMGTGAMGATTTCTACMTGAAMCTATAAM 

ACTGATGAMGAAATTGAAMTGACATTAAAAMTGGAAAGGTATTCCATOT 

14734 Al I ICI IGTGGAAATTGTCCTTTGCCCAATGTATGI ICI IGATGCCI I IGI IGAAAATTA 
GTTGACTATAAATGTGTGGATTTATTTGTGGG I ICI I I ATTCTGTTCCATTGGTCTATGT 
GTCTGI I I I I ATGCCAGTATCATGCAGTTTTGATTATTACAGGTTTGTACT 
AGTCAGGTCATGTGATGCCTCCAGCI I IGI ICI I I I I I CTCAGAATCTTATATTTAGAAA 
MCGTAMGACTCCMCAAAAMCCTGCTA^ 
[G,A] 

GATACAACATCMCATACAAMTTCAGCAGCA 

AAAAAAMGAMGAAAAAAAMCAAGAMTMTCCCATTTATMTAGCTACAMTAA 
AAMCACCTAGGMTAMCCATACCAMGMGTGAMGATTTCTACMTGA 
MCACTGATGAMGAMTTGAAMTGACATTAAAAMTGGAAAGGT ATTCCATGTTCATG 
GATTGCMGMTCMTATTGTTAAMTGTCCATATGATCCAAMCMTCTACA 

14747 ATTGTCCTTTGCCCAATGTATG I ICI IGATGCCI I IGI I GAAAATTAGTTGACTATAAAT 
GTGTGGATTTATTTGTGGG I ICI I I ATTCTGTTCCATTGGTCTATGTGTCTG I I I I IATG 
CCAGTATCATGCAGTTTTGATTATTACAGGTTTCT 

GATGCCTCCAGC I I IGI ICI I I I I I CTCAGMTCTTATATTTAGAAAAACGTAAAGACTC 

CMCAAAAMCCTGCTAGMCTGATAMCAMTTCATTAMTTTGCAG 

[A,G] 
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CATACAAMTTCAGCAGCATTTCMTATG^ 
AAAAAAAMCMQWVTMTCCCATTTATMTAG 

ATAMCCATACCAMGAAGTGAMGATTTCTACMTGAAMCTATAAAACACT 

GAAATTGAAAATGACATTAAAAMTGGAAAGGTATTCCA^ 

MTATTGTTAAAATGTCCATATGATCC^ 

14808 TGTGGATTTATTTGTGGG 1 1 LI I lATTCTGTTCCATTGGTCTATGTGTCTtal I I I IATGC 
CAGTATCATGCAGTTTTGATTATTACAGGTTTGTAGTATMTT^ 
ATGCCTCCAGCI I IGI I CI II I I ICTCAGMTCTTATATTTAGAAAMCGTAAAGACTCC 
MCAAAAMCCTGCTAGMCTGATAMCAMTTC^^ 
CATACAAAATTCAGCAGCATTTCMTATGCCMGAGC^ 
[-,A] 

AAAAAAMCMGAMTMTCC^TTTATMTAGCTAG^TAAMTAAMCACCTAGGAA 

TAAACCATACCAMGAAGTGAMGATTTCTACMTGAAMCrATAAMC^ 

AMTTGAAMTGACATTAAAAMTGGAMGGTATTC^ 

ATATTGTTAAMTGTCCATATGATCCA/^ 

AMTACCMTGACATTCTTCATTGAMTAAAAAAAMGCCTA^ 

15086 AATAATGTAAAAAAAAGAMGAAAAAAAAACMGAMTAATCCCATTT 

AMTAAMTAAMCACCTAGGMTAMCCATACCAMGMGTGAMGATTTCT 

AMCTATAAMCACTCyVTGAAAGAMTTGAAM^ 

ATGTTCATGGATTGC^GMTCMTATTGTTAAMTGTCC^ 

CAGATTCMTGCMTCCCTATCAAAATAC(^^ 

[-,A,G] 

CCTAAMTTTMGTGGAACCATGAAGGTAGATGT^ 

CAAOWVCCTTGAATATGAAGACT^ CTTCTATTCCC 
TGGTGAMTTTAGGAGAATGGATGTTTTATAATGGGTAGCAG I I I LI I ACATGTTCTCAA 
TCAGGIATMCTTACTACAGTCMTTT^ 
ATAAMTCCTAAAAAAGGAGAGAAGCACATATAMCCTGCGTCTT 

15414 TAGATGTCTGCTATACATAGAAGATTAAGTACTCMCAAACCTT 
GMGTGMTAGGCAGCTTCACTCTTCTATTCCCTGCT 

TATAATGGGTAGCAGI I I LI I A^TGrrCTCMTCAGCCATMCTTALTACAGTCMTTT 
GMTTTATTGCATTTGAATATATTGGATTA^ 

CATATAMCCTGCGTCTTATTTCATGTGTTCL I I I LI I I GTGGGTGAL I I I IGI I I IGAA 
[A,G] 

TAAMCCTGCAAMTM(^GGACAGGGTGGMGGGAGATGGGATCCCCT 
AGCAGCAGTCCTGI I I I AraCCrCTTCATTTTCTGTTATTGAGMTTCMGMGMGGA 
GG^GGMGAGTTCACATCCACAGACTGGTGTGGTTGMTAGTTCT 
MTAGCAGCCMTGAGGCTGTTACAGTGMGCCAGTCCCMGATMTTGTT 
TATTCTCTMGMGCTAAATTGTGTTAGACrGAMCCCATMGGMCC^ 

15722 TGCAAAATMCAGGACAGGGTGGMGGGAGATGGGATCCCCTCTTTAT 
GTCCTGTTTTATCACCTCTTCATTTTCTGTTA^ 
GAGTTCACATCCACAGACTGGTGrGGTTGMTAGTTGTCT^ 
GCCMTGAGGCTGTTACAGrGMGCCAGTCCCMGATAATTGTTCTGTACCC 
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TMGAAGCTAMTTGTGlTAGACTGAMCCCATMGGMC(^TTGrrC^GTTGGCTTG 
[T,C] 

TCAAAAGTAAAGA I I I I I MTAGTTTCTCTTMTTAGATTATTTTCTAAGACATAGAATT 

ATGATTACrATTTTATCTCTATMTTTTCATCTCT 

CCTTTGGAAAAMTTGGCTTTTAGCTTTACTTTTGC^ 

AGCCTAGGAMTTGGTACTATGACTTTTACTA^ 

ACTCAMGATGTTAMTATGGTGGCG\AGTTCACAMGCT 

15861 GGTGTGGTTGAATAGTTGTCrcrACTGTATr^^ 

TGA^GCGVGTCCCMGATMTTGTTCTGTACCCCTATTCTCrMGMGCTA 

AGACrGAAACCCATAAGGAACCATTGTTCAAAGTTGGLI IGI ICAAAAGTAAAGAI I I I I 

MTAGTnTCJCTTMTTAGATTATTTTCTMGACATAGM 

CTATMTTTTCATCTCTATMCGTTTACAAATACTGAMTMCCT^ 

[T,C] 

TTTAGCTTTAL I I I I GCMTATTTTATTTTATCCCCATAAMGCCTAGGAMTTGGTACT 

ATGACTTTTAGTATGTTCATTTMTAGATGAAMCACA 

GCTGGCCMCTTCACAMGCTGAT^ 

GATTTMTCTGTGACAGTGCACCTGGGTGCGCA^ 

TAGAACCTTTCCTAGTTGGCTTTGCrCC^^ 

16264 CTCAMGATGTTAMTATGGTGGCCMGTTCACAMGCTGATC^ 

CTGAACTCCTGG I I I ICTGATTTAATCTGTGACAGTGCACCTGGGTGCGCATGCATGCAT 

CACCCCCAG^Cn"GCACATAGMCCTTTCCrAGnrTGGCrTTGCTCCATGATGACCATTAC 

TGTTCCTTCTACTTCAAMTAAGCAMTTATCCTACAGATTCAGAGCrGGT^ 

CTGTCMGCAGCCCATTCCATTAGTCAGCrTGTGGTTCACTCACATTAMGTATTGACCr 

[A,T] 

AATGGTATATTTATCTAGATAATTCTACC I IGI I ATTTTCAAAGCCCCAGTC I IGI I IGC 
TAATTCTGTGCATCA I I I I I CTCTGATTCTGAAAGGCAAAA I I I IGI I GGGCAATTGCTG 
TAATATGAG I I I I ATCTCCTTTAGAGTCGAATGGATGTGTATATGTCACATGCTCCCACT 
GGTTCATCAGTA^CAACATTCTGCATATAAMCAGGTAGAGTCTTAGTCATGGAAAACC 
ATTCCAATCCITATTTTCAATATAT^ 

16314 ACAACAGGGCCTGAACTCCTGCj I I I ICTGATTTAATCTGTGACAGTGCACCTGGGTGCGC 
ATGCATGCATCACCCCCACACTTGCACATAG^ACCTTTCCTAGTrGGCm 
TGACCATTACTGTITCCTTCTACrrCAAMTM 
TACAGGTGTGCTGTCMGCAGCCCATTCCATTAGTCAGCTTCT 
GTATTGACCTAAATGGTATATTTATCTAGATAATTCTACCI IGI I ATTTTCAAAGCCCCA 
[G,A] 

TGI IGI I I GCTAATTCTGTGCATCA I I I I I CTCTGATTCTGAAAGGCAAAA I 1 1 IGI IGG 

GCMTTGCTCTMTATGAGTTTTATCTCCrTTAGAGTCGMTGGATGTGTATATCT 

TGCTCCCACTGGTTCATCAGTACIACM(IATTCrGC^^ 

ATGGAAMC(IATTCCAATCCTTATTTTCMTATATTTAAAM 

MCAGGCCTACCCTMGAATCTTAAGAGCTTGCTTCCAGTTTGTCC^ 

16877 TAAGAGCTTGCTTCCAGTTTGTCCTTGCTGCCTTCTCT^ 

TMGAGAMGGATGTTATGGTACAGACCMCTAGATGACATAMTGMCACCACO 
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TCAGAGI I I lAAAMTAGGCCCTGMCTGMGCAAGAGGTAAACrAGGGAAGCCrCAGGA 
GMCTGAGACTTCrCCAGAGAGAAGTATCTGGGATTTAACI I CI 1 1 CTAATGAGGCTTGG 
7TTTCCATGAACTTTTCCTTTAMCCM 
[A,G] 

TTTCTCATMTTCTAAMTGGCTGGTTACATCC^ 

CTCCTAGCATACAGCA I I I I I CTAAMTTTGCrGTTAGCTTTCATGATTCrrACCCTAAC 
TATTCI I I I ICTAAAAAACAI I IGI I I CACKTTTACCACTCTGATGAATTCAGAGCTTAT 
GACrGGGGAMTGACGCTGATMTATGAMGVTTACMT(^GGTGAGCTATTTA(^GrAA 
CCCCAGCATGCTGATTTTGATAMTTATM 

16966 AGTAGATGACATAAATGAACACCACCTTAAATCAGAGI I I I AAAAATAGGCCCTGAACTG 
MGCMGAGGTAMCTAGGGMGCCTCAGGAGMCTGAGACTTCT 
TGGGATTTAAC I I CI I I CTAATGAGGCTTGG I I I I CCATGAAC I I I I CCTTTAAACCAAG 
GGGGGTATTGCTCATCTTTCTGTTGAGCCCCAT^ 

CATCCTTCTGCTGATCTAGGAGCCCTATTTTCCTCCTAGCATAC^GCA I I I I ICTAAAAT 
[T,G] 

TGCTGTTAGCTTTCATGATTCTTACCCTAACTATTCI I I I ICTAAAAAACAI I IGI I ICA 

GCTTTACCACTCTGATGAATTCAGAGCTTATGACT 

ACATTACMTC^GGTGAGCTATTTACAGTM 

ATAAAAMTTATTTGAGGGTGGAAAGACTCCTACCTGT 

TAGAACI I I I I I I lAAAAAMTTTTMTTTTMTTTTMTTTATTTC^ 

17147 GGGGTATTGCTCATCTTTCTGTTGAGCCCCATTTCT 

ATCCTTCTOTGATCTAGGAGCCCTATTTTCCTCCTAGCATACAGCA I I I I ICTAAAATT 

TGCTGTTAGCTTTCATGATTCTTACCCTMCTATTCI I I I ICTAAAAAACAI I IGI I ICA 

GCTTTACCACTCTGATGAATTCAGAGCTTATGACTGGGGAMTGACGCTGATM 

ACATTACMTCAGGTG^GCTATTTACAGTMCCCCAGCATGCT 

[A,G] 

TAAAAAATTATTTGAGGGTGGAMG^CTCCTACCTGTCATT^ 
AGAACI I I I I I I I AAAAAMTTTTMTTTTMTTTTAATTTATTTCAGA 
TTAMGAAG(IATATACAAAGAAACTTACATCATCTGTMTCC^ 
AGATCTACTAACATTTTGCTCTATTTATTCCMTTTTCTCACT 
CMCTTTTMTCTTTCTATTTTACTTAAGCTATA 

17219 ATCTAGGAGCCCTATTTTCGTCCTAGCATACAGCA I I I I I CTAAAATTTGCTGTTAGCTT 
TCATGATTCTTACCCTAACTATTC I I I I ICTAAAAAACAI I IGI I I CAGCTTTACCACTC 
TGATGMTTCAGAGCTTATGACTGGGGAMTGACGCTGATM 
GCTGAGCTATTTACACTAACCCCAGCATCCT"GATTTTGATAAATTATMTAAA 
TTGAGGCTGGAMGACTCCTACCTCTCATTTGCTGGCATTTATACTGAT^ I I I I I I 
[T,C] 

TAAAAAAATTTTMTTTTMTTTTMT^ 

ATACAMGAM(TTACATCATGTGTMTCCTTCCATCCAGAGATMCTAC^TCTACT 
ATTTTGGTGTATTTATTC(^TTTTCTCACTATTATATTGCI I I I AGACAACTTTTAATC 
TTTCTATTTTACTTMGCTATAGTMGAGATMCTMTATMCTGAGGGA I I I I IAAATG 
CAI I I I I MTC^C<TAC^TMTAGAMTTATTTCATAAAMTCTTTACAC^TAMTGAAT 
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18628 AAMTGAMCWATCMCACGG^^ 

GAGAGTGTTMTGTCCTTAGAATTTGGCCAC^^ 

GGTCCTATTTTGTGMTTAATCTCATTTGATGCCAA I 1 1 1 1 ATTACATTCTCTCCAAAAA 
ACTAGTCTCMCAGTTTGXTCTCTCCrC^ 

TTTTATTGACTATMGAGAATTMCCCATGrMGCrCCATGAGGGTAGGGATTTCT 
[A,G] 

I I I Id I I CACCAGTGrrTTCrCATCTTGAAGAGrACATGACMTTACTGGGCTCCGVGTA 
TCTATGTGTTGCATTAATGAAA I I I CI I MCTTTMTCTACCTCAAMTGrcrCTATCTT 
CITGATTCTCTCCTTCCTrTCrCTATCAGAAM 
TCCGGTCCrGTGCCCrrGATCCCATCrCTTCrCACTTCCCClTCCTTCCT 
TCCTGTCCCTTATGAAAMCAAGCMGACCATCMTTCTATCM 

1865 5 TCAAGATCATTATGGTCMGTACTAMGTATGTGAGAGTGTTAATGTCCTTAGMTT^ 
CCACAGTTAGCTGGTCCTACTCTGCTCC^ 

TGATGCCAAI I I I I ATTACATTCTCTCCAAAAMCTAGTCrCMCAGTTTGCTCrCTCCT 
CMGirCACAGCATTATCTCTGCTATATCTATATTTTAT^ 

ATGTAAGCTCCATGAGGGTAGGGATTTCTCATCGI I I IGI ICACCAGTGI I I ICTCATCT 
[T,G] 

GAAGAGTACATGACAATTACTGGGCTCCCAGTATCTATGTGTTGCATTMTGAM 
TMCTTTMTOACCTCAAMTGTCTCTATC. I I LI I G^TTCTCTCCTTCCTTTCTCTATC 
AGAAMTGATGGTCCTCrTATTTTCCAAGTTATTCCGGTCCT 
CTTCTCACTTCCCCTTCCn~CCTG^ 

ACC^TCAATTCTATCAAGTTATCATTATGTCACTCTG I ICI I ATCAACATA I I I I I ACTA 

18984 CAGTATCTATGTGTTGCATTAATGAAA I I IGI I MCTTTAATCTACCTCAAAATGTCTCT 
ATCTTCTI GATTCTCTCCTrCCTTTCrcrATCAG7W\ATGATGX7TCCTCrrATTTTCCAA 
GTTATTCCGX^CCTGTGCCCTTGATCCCATCTCTTCTCACrrCCCCTTCCTTCCTGCCTC 
CATTCTCCTGTCCCTTATGAAAMCMGCAAGACCATCAAT^ 

GTCACTCTG I IGI I ATCAACATA I I I I I AGTATTGAAGAGGGG I IGI I CTACTTACTCCT 
[G,T] 

MCCTTGTACAATCTAGTTTAGGTCTTCATG I I I I I ATCATAGCTACCTTATTTAAAGTC 

ACCCATGGCTTTTMTTGCCAMTTCMTGGCCTATCr^ 

TTCGTTACCACAGTCTCCTTGAMCTCAGTCCCCTGACT^ 

TTTCTGATTTTCC1TCTGTTTGTGATTGTTCGI I I I GTCCCAGGCACTGGCTACTCCACC 
TTCCACCTCTCTGAMTCATTAGCATTC I ITCTTCCT 

19407 CGTTACCA(^GTCTCCTTGAMCrCAGTCCCCTGACTTGG^ 

TCTGATTTTCCTTCTGTTTGTGATTGTTCG I I I I CTCCGaG^CACTGGGTACTCCACCTT 
CCACCTCTCTGAMTCATTAGCATTCCCCMGGATTCTTCA I I ICI ICCTTG 

GAGMGTCAGCATAGCTTTMTTTGGACCATTTCTAT I I I I ICAGGA 

CTTGXICTTCMCCTATTC1TTCTCT 
[C,T] 

GMGACAGACCTCCGAGAMTGACCCTTGTCTCCAAAACTTCCGCMW 
CCTAGCCTGACATTCAGACTTTGATT^^ 
TTATATATTCTGTTCTCCAGGTACACT 
ACTCTTCCTGCCTCCCACTCACCCT 
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GTG\TTT(^(^GGACCCCrCTTTCrATG^GCCCTCAGGTG(WV\TM 1 1 1 1 1 IGCCTTT 

19531 CrCTCTGAMTC^TTAGCATTCCCCMGGATTCTTCAAMCTCTL I MCI ICCTTGGAGA 
ACTC^GC^TAGCTTTMTTTGGACCATTTCTATGGCTTATCTAGA I I 1 1 1 ICAGGACTTG 
CCTTCAACCTATTCrrTCTGTAGGrGATTCCA^ 
GACAGACCrCCGAGAMTGACCCrTGTCrCCAAMCTTCCGCAATATCTCCA 
AGCCTGACATTCAGACITTGATTATCTGCCrCCMGTTTATATCCT 
[T,C] 

ATATTCTGTTCTCC^GGTACACT^ 

TTCCTGCCTCCCACTCACCCrCATCTCTGCTGTO^AAATGCM 
TTTCACAGGACCCCTCTTTCTATGMGCCCTCAGGTGGAAATAA I I I I I IGCCI I I I I I I 
CCATTTTATTTTTGGAGTGTTTATGGCATTTM 

GCCTTGCrCCCTCTTTTGCAAA I I I CI I AAAGGTAGAGACCATTGTATG I I I I C_ I ICATA 

19911 CTCATCTCTGCTGTCAAMTGCAACCTTCCCTCA^ 

CTATGAAGCCCTCAGGTGGAAATAAI I I I I IGCCI I I I I I I CCATTTTA I I I I IGGAGTG 
TTTATGGCATTTMCATACCmCTTTGTATAO^ 

AAA I I I CI I AAAGGTAGAGACCATTGTATG I I I ILI I CATATGTTGCTGGTGCCTAACAG 

AACTATGGCCATTGTCCACATTCA^ 

[C,T] 

CTCTCATGMTGCCCTTGCTTTCTCTCCCACAGAGTC^ 

GCG^TGAMGTGCCTACTGCTATTTGGGCTGGTGGACATGATCT 

GATGTGGCCAGGATACTCCCTCAMTCMGAGTCTTCATTACT 

TGGMCCACTTTGATTTTGTCTGGGGCCTCGATGCCCCTCAACG 

ATAGCTTTMTGMGGCATATTCCTAMTGCMTGCATTT^ 

20199 TTTGAGGAGOTCCTCrCATGMTGCCCTTGCrrTCTCTCC 

ATATGACCTGACTGCCATGAMGTGCCTACTGCrATTTGGGCT 

CGTMCACCCCAGGATGTGGCCAGGATACT^ 

GCTATTGCCAGATTGGMCCACTTTGATTTTGTCTGGGGCCTC 

GTACAGTGAMTCATAGCTTTAATGAAGGCATATTCCTAAATGCMTG I MIC 

[A,G] 

ATTAAMGTTGCrrCCAAGCCCATMGGGACTTTAGAAAAMTGGT 
TTGTCCCCCAGCACCCTGGGGG^GATGCACAGTGGAGTCTGTTTTCCMCT 
TAGTGTTATTTATGTTTAGAGAC^TCTTTGCATGGGACC^TCT 
ATGAGGTAGATTAGGCAAAAAGATAAACMGTTG^ 

TTAAATTGTAA I I I I lAGGGCATACCATGVVGrATAGAMTGTCTGMGCTTCAAAGGAA 

20243 AGAGTCATCCCCCTATATATGACCTGACTGCCATGAMGTGCCT 

GTGGACATGATGTCCTCGTMCACCC(^GGATGTGGCCAGGATACTCCCrO 

GTCrrCATTACTTTMGCTATTGCCAGATTGGMC 

ATGCCCCTCMCGGATGTACAGTGAMTC^^^ 

MTGCATTTACTTTTCMTTAAMGTTGCrrCCM 

[G,A] 

GTMCCMCMTGAGGTTGTCCCCCAGCACCCTGGGGGAGATGCACAGTGGAGTCTGT^ 
TCGAAGTCAATTGTGTTAGTGTTATTTAT^ 
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CAGGTCCTTATAMCAATGAGGTAGATTAGGCAAAAAG^ 

TGGCATTTAAGTCTAATTAAATTGTAA I I I I I AGGGCATACCATGAAGTATAGAAATGTC 
TGMGCTTCAAAGGMCAGTGAMTTCCTTTMGGTCCT 

20640 GACATCTTTGCATGGGACCATCTACAGGTCCTTATA^ 

AGATAAACMGTTGCrACrCTATCTGGCATTTMCrrCrMTTAMTTGrAA I I I I IAGGG 
(XTACCATQWjTATAGAMTCTCTGM 

CCrATATGGAAACCrCTGTTGT CATTTTATTTATATGGATTGCTATGGCAATGGACAGAG 
TGTGGGATTAGGAGGAGGGCCTGTAACI ICI I I ATAAAAG I I I CI I AGCTATCCTGAAGA 
[T,C] 

GTATAGACA I I I I I AU I I I I I AGGTATTTTCAACATCAGAM1TCAAAAMGTCCCCM 
. AGATTCTTCCAGAGAAGCCCTC I I I I C_ I I ACAATCTTATCCCTGGCTATCTGCGTAAACG 
GMTCTTGMCCCATMTAGGATACATGTATAAMTCTTCCrTATTAM 
TTGTACAGCATCAATATCATTTTATAATCATAGGGAGGCI ICI I IGI I I AGCATGTAATG 
CCCCCTTTACAGGL I I I I IG I ICI I I GAGGGGTTTGAACATTCCATGAAAAACTGACAGA 

21156 AGGCI ICI I IGI I I AGCATGTAATGCCCCCTTTACAGGL I I I I IGI ICI I IGAGGGGTTT 
GMCATTCCATGAAAMCTGACAGATAGGAMCTGACMTAAMGATTGAGCT 
GAAGCAGAMGTACTAGGCTAGATAGTCTCTAMCATTAAGTAI I I ICI ICCTCCATCTT 
AAMGCAATGAGMGCCACCAAMTATTTTAC 
GTMCCACCACTTTGGCTGCrACATAGAG^ 
[G,C] 

AGCMGTCTGTAMTCTGATCTV^GTGlTCTGATGCAGGCrGATATC 

GAGATGATCCTTGGAAAATCCAGAGCCAGCTCCATMTACTTTCCTGCTCTGCTGGC 

TCCACMGCrGCrGGCCCCTGGAGCCATTCTTCTCTCAAMCTAGCATTCATC 

TGTATACGTATTGATGGGGAATMTGGTCACTATGAAAACCATGTGATAATATGGAAAAA 

TACCCATGATATMTGTTATGTGMGAGMGAAAATGAAACTGGTAGAACTATGTGATTG 

21163 I I IGI I IAGCATGTAATGCCCCCTTTACAGGCI I I I IGI ICI I I GAGGGGTTTGAACATT 
CCATGAAAMCTGACAGATAGGAMCTGACMTAAMGATTGAGCTAMGATGGAAGCAG 
AAAGTACTAGGCTAGATAGTCTCTAAACATTAAGTA I I I ICI I CCTCCATCTTAAAAGCA 
ATGAGMGCCACCAAMTATTTTACCTAATGGAAACCTGATTGCCGCA I I I I IGTAACCA 
CCACTTTGGCTGCTACATAGAGMTGGATTAGMGATGCCMCAAMGATTCrGAGCAAG 
[A,T] 

CTGTAMTCrGATCMGTGTTCTGATGCAGGCTGATATCCTTCrGTGCTMGAGAGATGA 

TCCTTGGAAMTCCAGAGCCAGCTCCATMTACTTTCCT 

GCTGCTGGCCCCTGGAGCCATTCTTCTCTCAAMCTAGCATTCATCMTT^ 

GTATTGATGGGGMTMTGGTCACrATGAAMCCATGTGATMTATGGAAAAATACCCAT 

GATATMTGTTATGTGMGAGMGAAMTGAMCTGGTAGMCrATGTGATrGCAAATAT 

21425 MTGGATTAGAAGATGCCMCAAMGATTCTGAGCMGTCTGTAMTCTGATC 
CTGATGCAGGCTGATATCCTTCTGTGCTMGAGAGATGATCCTTGGAAM 
GC^CCATMTACTTTCC^GCTC^GCTGGCAMTCG^CMGC^GCTGGCCCCTGGAGCG^T 
TCTTCTCTCAAMCTAGCATTCATCAATTTMTGTATACGTATTGATGGGGMTM 
G^CrATGAAMCG^TGTGATAATATGGAAAMTACCCATGATATAATGTTATGTGAAGAG 
[G,A] 
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AGAAMTGAMCTGGTAGAACTATGTGATTGCAMTATA 
ATGACTTTATAAMTATTTGTATATMTGAAMCTGM 

TTGTGTCAGGGT AGTAACATGATGAGTGATTAATAG I I I I IAAI I I I I AATATAGTAATG 

ACATMTGTTACMCTTCTC^^ 

ATAAMGMTACATATTTTATTATACATTT^ 

Chromosome map: 
Chromosome 10 
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